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Method for enrichment involving elimination by mismatch hybridisation 

All patent and non-patent references cited in the present application are 
hereby incorporated by reference in their entirety. This application is a non- 
5 provisional of U.S. provisional application Serial No. 60/544,343 

filed 17 February 2004, which is hereby incorporated by reference in its 
entirety. 

Technical Field of the Invention 

10 The present invention relates to a method for obtaining the identity of one or 
more display molecules capable of associating with a moieculantarget. The 
display molecules initially form part of a library and the present invention 
devises a method for identifying such display molecules that possess certain 
properties relative to a target. 

15 

Background 

Methods for obtaining information about a display molecule that possesses a 
binding characteristic towards a molecular target are used in the phage 
display area and are generally known as panning. Atypical panning protocol 
20 includes a library of phages displaying certain specific polypeptides and a 
target. When the library and the target are mixed, polypeptides that have an 
ability to bind to the target will form an association complex. Polypeptides 
that do not bind are washed away. 

25 In theory, it should be possible to rank the members of the library in 

accordance with their binding affinity in an elution step. Thus, the best 
binders could easily be identified following a single mixing step. However, 
practical experiments show that libraries of many members cannot properly 
be partitioned for good and bad binders and a second contacting with the 

30 target is necessary. Generally, libraries of the size 10 8 to 10 12 are used to 

increase the chance of finding a successfully binding polypeptide. Therefore, 
several rounds are generally required. To obtain a sufficient quantity of 
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phages for performing a second contacting with a molecular target, the 
phages harbouring the binding polypeptides are amplified using the genetic 
material (DNA or RNA) of the eluted binding phages. It is usual to perform 3 
to 12 rounds of contact between the increasingly enriched libraries and the 
5 target before a minor group of binding peptides can be identified. 

Systems that resemble the phage panning have been evolved by the present 
applicant (WO 02/103008 A2) however with the possibility of displaying other 
molecules than polypeptides. Other systems using the same principles as the 
10 phage panning include WO 98/31700, WO 93/03172, and WO 00/23458. 

For some systems, e.g. as disclosed in EP 643 778 B1 and WO 93/06121 
A1, it is not immediately possible to perform amplification of the complexes 
after elution of binding display molecules associated with an identifying 
15 nucleic acid. 

One known method for selecting useful molecules from a library is described 
in PCT application with publication no. WO 2004/099441 (Hyscite Discovery 
A/S), incorporated by reference herein. This method makes use of a primary 
20 library, which comprises the candidate molecules of the library marked with 
nucleic acid tags, and a secondary library, which is used for amplifying and 
identifying the nucleic acid tags of the molecules in the primary library. 

The present invention aims at providing a method that only requires a single 
25 initial contact between a target and a library of display molecules associated 
with an identifying nucleic acid even for large libraries. Thus, reiterated 
amplification of display molecules associated with an identifying or coding 
nucleic acid between each round of contact with the target can be avoided. 
Moreover, encoding methods not previous available for large libraries 
30 becomes at the disposal of the person skilled in the art. 
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Summary of the Invention 

The present invention relates to a method for obtaining display molecule(s) 
having affinity towards a target, comprising the steps of: a) providing a library 
comprising a plurality of different display molecules, each display molecule 
5 being associated with an identifier oligonucleotide, which codes for the 
identity of said display molecule, b) contacting the library with a target to 
allow for an interaction between the display molecules of the library with the 
target, c) partitioning a fraction preferably enriched with identifier 
oligonucleotides of display molecules interacting with the target, d) subjecting 
1 0 the fraction to conditions at which hetero-duplexes and homo-duplexes are 
formed, e) recovering the homo-duplexes, and f) deducing from the homo- 
duplexes the identity of the display molecule(s) interacting with the target. 

An ideal initial library comprises an equal amount of each display molecule 
15 associated with the identifier oligonucleotide. The display molecule 
associated with the identifier oligonucleotide is also for short termed 
"complex" in the following. It is estimated that a standard vial can comprise 
around 10 14 complexes. Therefore a library of 10 8 complexes ideally 
comprises 10 6 copies of each complex. 

20 

The contacting between the library and the molecular target, and the 
subsequent partitioning of a fraction of the complexes or just the identifier 
oligonucleotides thereof, interacting with the target forms an imbalance in the 
amount of the individual members of the library. If elution is used for the 
25 partitioning, complexes binding with high affinity to the target will be eluted in 
a relatively higher amounts compared to low affinity binding complexes, 
which will be eluted in a correspondingly minor amount. 

The obtained imbalance at the nucleic acid level is used in the subsequent 
30 steps. The identifier oligonucleotide parts of the binding complexes are 

usually, but not necessarily, amplified by PCR or similar to obtain a higher 
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total amount of oligonucleotides while retaining the proportion between the 
individual oligonucleotides. 

The PCR amplification provides a homo-duplex product. A mixture of homo- 
5 and hetero-duplexes is appropriately initiated by subjecting the PCR 

amplification product to a denaturing process to separate the duplexes into 
single stranded oligonucleotides. Subsequently, the mixture of single 
stranded oligonucleotides is allowed to hybridise to form the mixture of homo- 
and hetero-duplexes. The denaturing step is suitably obtained by heating the 

1 0 nucleic acids above the melting temperature of the duplexes and the 

hybridisation step is suitably conducted by lowering the temperature below 
the melting temperature of the duplexes. Due to the presence of various 
different single stranded oligonucleotides some oligonucleotides may find a 
perfect matching partner and form a homo-duplex, while other 

15 oligonucleotides will hybridise to non-complementary binding partners and 

form hetero-duplexes. The present invention takes advantage of the fact that 
the oligonucleotides most abundant easier will find perfect binding partners 
and form homo-duplexes. 

20 After the formation of the mixture of homo- and hetero-duplexes has been 

allowed the homo-duplexes are recovered. Recovery is usually conducted by 
eliminating or reducing the amount of hetero-duplexes. Several methods are 
available for reducing the amount of hetero-duplexes, including dHPLC as 
disclosed in US 5,795,976, and enzymatic degradation. In some aspects of 

25 the invention enzymatic degradation is preferred due to the availability of 
enzymes specifically locating one or more mis-match pairing nucleobases. 

The recovered pool of homo-duplexes may not fully be depleted for hetero- 
duplexes and/or the diversity of the pool may still be too high for a meaningful 
30 decoding. In an aspect of the invention, the step of forming a mixture of 

homo- and hetero-duplexes is therefore repeated one or more times, i.e. the 
recovered pool is, optionally after nucleic acid amplification, subjected to 
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denaturing conditions and subsequent to hybridisation conditions to allow for 
the formation of a new mixture of homo- and hetero-duplexes. The new 
mixture is treated as stated above to recover the homo-duplexes. 

5 The identity of the display molecules that possessed the ability of binding to 
the molecular target is finally revealed by decoding the homo-duplexes, using 
conventional methods. 

Brief Description of the Figures 

10 Fig. 1 discloses a schematic representation of a selection process. 
Fig. 2 depicts a homo- and a hetero-duplex. 

Fig. 3 shows a library treated in accordance with the method of the invention. 
Fig. 4 discloses the overall principle of mis-match selection. 

15 Detailed Description of the Invention 

The target may be of a biological origin or may be synthetic molecular target. 
Typically, the molecular target stems from an organism selected from human 
and animals, especially vertebras. However, in other embodiments the target 
may originate from a plant. In the quest for a compound with therapeutical 

20 effect on the human or animal body, the target is usually expected to have an 
importance in a therapeutically theory that combats a certain disease. In the 
quest for discovering compounds with plant protective effect, the target is 
usually expected to originate from an organism that harms the crop or a 
competing undesired plant. The organism may be a fungus when a 

25 compound with fungicide effect is searched for or an insect when a 

compound having insecticide effect is desired. Optionally, a protein target 
stemming from a biological origin may be derivatised by altering, adding, or 
deleting one or more amino acids. 

30 The target may be a protein, a small molecular hormone, a lipid, a 

polysaccharide, a whole cell, a nucleic acid, a metabolite, a heme group, etc. 
In a preferred aspect the target is a protein. The protein may serve the 
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function in the organism of being an enzyme, a hormone, a structural 
element, a regulatory protein, a membrane channel or pump, a part of a 
signal transducing cascade, an antibody, etc. Suitable target enzymes 
include kinases, phosphatates, and proteases. The protein may occur as an 

5 independent entity or may be dimers, trimers, tetramers, or polymers and the 
protein may comprise a prosthetic group. Furthermore, the molecular target 
may be a soluble or insoluble agglomerate of one or more proteins and one 
or more substituents occurring in the body or artificial components. In another 
preferred embodiment, the molecular target is a nucleic acid, such as DNA or 

10 RNA aptamer or ribozyme. 

The target may be immobilized to a solid support. The solid support can be a 
bead or the surfaces of a well. The target immobilized on the solid support 
may also form a stable or quasi-stable dispersion in the media; for example 
15 the immobilized target may be covalently (or otherwise) coupled to beads or 
matrices formed by a polymer such as a polymer selected from the group 
consisting of: 

polystyrene, polyethyleneglycol, a-peptides, (5-peptides, y-peptides, co- 
peptides, mono-, di- and tri-substituted a-peptides, p-peptides, y-peptides, o> 

20 peptides, peptides wherein the amino acid residues are in the L-form or in the 
D-form, vinylogous polypeptides, glycopoly-peptides, polyamides, vinyiogous 
sulfonamide peptides, polysulfonamides, conjugated peptides comprising 
e.g. prosthetic groups, polyesters, polysaccharides, polycarbamates, 
polycarbonates, polyureas, polypeptidylphosphonates, polyurethanes, 

25 azatides, oligo N-substituted glycines, polyethers, ethoxyformacetal 

oligomers, poly-thioethers, polyethylene glycols (PEG), polyethylenes, 
polydisulfides, polyarylene sulfides, polynucleotides, PNAs, LNAs, 
morpholinos, oligo pyrrolinones, polyoximes, polyimines, polyethyleneimines, 
polyimides, polyacetals, polyacetates, polystyrenes, polyvinyl, lipids, 

30 phospholipids, glycoiipids, polycyclic compounds comprising e.g. aliphatic or 
aromatic cycles, including polyheterocyciic compounds, proteoglycans, and 
polysiloxanes, including any combination thereof. 
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More preferably, said polymer is polystyrene or polyethyleneglycol 

In a certain embodiment, the target is in solution and all the interaction occur 
in the solution too. The absence of an immobilization step generally reduces 

5 the background noise because there is no background surface to associate 
to. Thus the result of the assay may be more sensitive. In solution, the only 
background noise imaginable is when the oligonucleotides or display 
molecules of the library of complexes binds unspecific to the target 
molecules. The absence of an immobilization step generally necessitates a 

10 subsequent recovery step, e.g. by chromatography. 

1n certain aspects of the invention, it is preferred to immobilize the target on a 
solid support. The solid support may be beads of a column or the surface of a 
container. The immobilisation of the molecular target may ease the removal 

15 of the non-binding complexes by washing or similar means. In a certain 

embodiment, a cleavable linkage between the molecular target and the solid 
support is present. The cleavable linker is preferably selectively cleavable, 
that is, the linkage can be cleaved without cleaving other linkages in the 
target or the complexes. The cleavage of the linkage between the molecular 

20 target and the solid support may reduce the contribution from the 

background, such as complexes associated with the surface of the solid 
support and not binding to the molecular target. 

The target may be obtained in any suitable way. A variety of targets are 
25 commercially available, either as purified protein or as the corresponding 
cDNA. Other protein or peptide targets may be isolated from tissues or 
mRNA (or the corresponding cDNA) may be extracted from a tissue. Smaller 
peptides may be synthesised chemically using the standard solid-phase 
Fmoc peptide synthesis. When nucleic acids are used or included in the 
30 molecular target, it may be synthesised using the standard amedite synthesis 
method or by using the natural machinery. 
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It may be an advantage to have all or at least a part of the identifier 
oligonucleotides on a double stranded form during the contacting with the 
molecular target, as certain nucleic acids may perform a binding interaction 
or a catalytical action on the components present during the contacting step. 
5 Thus, in one embodiment of the invention, the identifier oligonucleotide partly 
or fully is hybridised to a complementing oligonucleotide. 

The identifier oligonucleotide comprises the information necessary for 
decoding the identity of the display molecule. The identifier oligonucleotide 

10 may be analysed directly in some instances to reveal the identity of the 
display molecules that have performed an interaction with the target. The 
informative part can be decoded in a standard sequencing machine. In 
general however, it is preferred to include the informative part of the coupled 
product in to a suitable vector and transfer the vector to a host organism. The 

1 5 host organisms may then be cultivated on a suitable substrate and allowed to 
form colonies. Samples from the colonies may be used for sequencing in a 
sequencing machine. Also, the identification may be conducted using any 
sequencing method known in the art, including QPCR, microarrays, etc. 

20 Display molecule and identifier oligonucleot ide association 

The display molecule associated with the identifier oligonucleotide is 
sometimes herein referred to as a Afunctional complex to indicate that a 
physical connection between the display molecule and the identifier 
oligonucleotide normally is present. However, in certain embodiments of the 

25 present invention the association between the display molecule and the 
identifier oligonucleotide may be spatial, i.e. an identifier oligonucleotide 
specifies the spatial position of a display molecule. The term "bifunctional 
complex" is intended also b cover the latter embodiment. The identifier 
oligonucleotide comprises identifying moieties that identify the display 

30 molecule. Preferably, the identifier oligonucleotide identifies the molecule 

uniquely, i.e. in a library of complexes a particular identifier oligonucleotide is 
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capable of distinguishing the molecule it is associated with from the rest of 
the display molecules. 

The display molecule and the identifier oligonucleotide may be attached 
5 directly to each other or through a bridging moiety. In one aspect of the 
invention, the bridging moiety is a selectively cleavable linkage. 

The identifier oligonucleotide may comprise one, two or more codons. The 
codon sequences can be decoded to identify reactants used in the formation 
10 of the display molecule. When the identifier oligonucleotide comprises more 
than one codon, each member of a pool of chemical entities can be identified 
and the order of codons is informative of the synthesis step each member 
has been incorporated in. 

15 The sequence of the nucleotides in each codon may have any suitable 

length. The codon may be a single nucleotide or a plurality of nucleotides. In 
some aspects of the invention, it is preferred that each codon independently 
comprises four or more nucleotides, more preferred 4 to 30 nucleotides. 

20 The identifier oligonucleotide will in general have at least two codons 

arranged in sequence, i.e. next to each other. Two neighbouring codons may 
be separated by a framing sequence. Depending on the display molecule 
formed, the identifier oligonucleotide may comprise further codons, such as 
3, 4, 5, or more codons. Each of the further codons may be separated by a 

25 suitable framing sequence. Preferably, all or at least a majority of the codons 
of the identifier oligonucleotide are separated from a neighbouring codon by 
a framing sequence. The framing sequence may have any suitable number 
of nucleotides, e.g. 1 to 20. Alternatively, codons on the identifier 
oligonucleotide may be designed with overlapping sequences. 

30 

The framing sequence, if present, may serve various purposes. In one setup 
of the invention, the framing sequence identifies the position of the codon. 
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Usually, the framing sequence either upstream or downstream of a codon 
comprises information, which allows determination of the position of the 
codons. In another setup, the frames have alternating sequences, allowing 
for addition of building blocks from two pools in the formation of the library. 
5 The framing sequence may also or in addition provide for a region of high 
affinity. The high affinity region may ensure that the hybridisation of the 
template with an anti-codon will occur in frame. Moreover, the framing 
sequence may adjust the annealing temperature to a desired level. 

10 By a molecule having an "affinity" for molecule X is meant herein that a 
molecule with affinity for molecule X will bind to molecule X in a certain 
detectable amount under certain conditions but will not (optionally detectably) 
bind other, different molecules (for which it does not have affinity for) to the 
same extent under identical conditions. One measure to describe a 

15 molecule's affinity to another molecule is a dissociation constant, Kd. The 
smaller the Kd, the stronger the affinity. Dissociation constants can be 
determined using methods well-known in the art, such as surface plasmon 
resonance analysis. Herein, it is preferred that a molecule with "affinity" for 
another molecule X has a Kd for said molecule X that is less than 100 mM, 

20 such as less than 10 mM, for example less than 5 mM, such as less than 1 
mM, for example less than 0.1 mM, such as less than 0.01 mM, for example 
less than 1 uM , such as less than 100 nM, for example less than 10 nM, 
such as less than 1 nM, , for example less than 100 pM , such as less than 
10 pM, for example less than 1 pM. Furthermore, it is herein preferred that a 

25 molecule that "does not have an affinity" to molecule X has a dissociation 

constant, Kd with respect to binding molecule X that is at least 10 fold larger, 
such as at least 20 fold larger, for example at least 30 fold larger, such as at 
least 40 fold larger, for example at least 50 fold larger, such as at least 60 
fold larger, for example at least 70 fold larger, such as at least 80 fold larger, 

30 for example at least 90 fold larger, such as at least 100 fold larger, than the 
Kd of the binding (to molecule X) of a molecule that does have affinity to 
molecule X. Most preferably, there is at least a ten-fold difference in Kd 
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between those molecules considered to have an affinity and those deemed 
not to have an affinity to a molecule X. 

A framing sequence with high affinity can be provided by incorporation of one 
5 or more nucleobases forming three hydrogen bonds to a cognate 

nucieobase. Examples of nucleobases having this property are guanine and 
cytosine. Alternatively, or in addition, the framing sequence may be subjected 
to backbone modification. Several back bone modifications provides for 
higher affinity, such as 2-O-methyI substitution of the ribose moiety, peptide 
10 nucleic acids (PNA), and 2'-4' O-methylene cyclisation of the ribose moiety, 
also referred to as LNA (Locked Nucleic Acid). 

The identifier oligonucleotide may comprise one or two flanking regions. The 
flanking region can encompass a signal group, such as a flourophor or a 
1 5 radioactive group to allow for detection of the presence or absence of a 

complex or the flanking region may comprise a label that may be detected, 
such as biotin. When the identifier oligonucleotide comprises a biotin moiety, 
the identifier oligonucleotide may easily be recovered. 

20 The flanking region(s) can also serve as priming sites for amplification 
reactions, such as PCR. The identifier oligonucleotide may in certain 
embodiments comprise an affinity region having the property of being able to 
hybridise to a building block. The priming sites for PCR amplification may be 
identical for all the identifier oligonucleotides to allow for a proportional 

25 amplification of the individual identifier oligonucleotides at various stages of 
the present method. Alternatively, the different priming sites with 
corresponding primers may be used to favour the amplification of certain 
groups of complexes in the library. 

30 It is to be understood that when the term identifier oligonucleotide is used in 
the present description and claims, the identifier oligonucleotide may be in 
the sense or the anti-sense format, i.e. the identifier oligonucleotide can be a 
sequence of codons, which actually codes for the molecule or can be a 
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sequence complementary thereto. Moreover, the identifier oligonucleotide 
may be single-stranded or double-stranded, as appropriate. 

During the contacting step the identifier oligonucleotide usually is in double 
5 stranded form to minimise any interaction relative to the target. However, it 
may be suitable to establish the identifier oligonucleotide in single stranded 
form prior to the formation of the mixture of the hetero- and homo-duplexes. 
Starting with a double stranded identifier oligonucleotide, a single stranded 
identifier oligonucleotide may easily be prepared by extension of a forward 
1 0 primer annealed at a priming site of the identifier oligonucleotide. 

The display molecule part of the complex is generally of a chemical structure 
expected of having an effect on the target. When the target is of 
pharmaceutical importance, the molecule is generally a possible drug 

1 5 candidate. The complex may be formed by tagging a library of different 

possible drug candidates with a tag, e.g. a nucleic acid tag identifying each 
possible drug candidate. In another embodiment of the invention, the 
molecule is encoded, i.e. formed by a variety of reactants, which have 
reacted with each other and/or a scaffold molecule. Optionally, this reaction 

20 product may be post-modified to obtain the final molecule displayed on the 
complex. The post-modification may involve the cleavage of one or more 
chemical bonds attaching the encoded molecule to the identifier in order 
more efficiently to display the encoded molecule. In still another embodiment 
the display molecule is a polypeptide formed using the natural machinery, 

25 such as the methods disclosed in WO 92/02536, WO 91/05058, and US 
6,194,550. 

A variety or methods for association of an oligonucleotide to a polypeptide 
display molecule is available for the skilled person in the art. An option 
30 involves the association of a display molecule protein with the mRNA 

responsible for the formation thereof. This method is generally referred to as 
mRNA display. Optionally, the mRNA may be substituted with the 
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corresponding cDNA. A method for generation such a single or a library of 
fusions between a protein and the mRNA responsible for the formation 
thereof is disclosed in WO 98/31700. The corresponding DNA strand may be 
attached to the protein using the method disclosed in WO 00/32823. The 
5 contents of both patent applications being incorporated in their entirety by 
reference herein. The method of WO 98/31700 includes providing a RNA 
stand comprising a translation initiation sequence, a start codon operable 
linked to a protein encoding sequence, and a peptide acceptor at the 3' end 
and translating the protein encoding sequence to produce a RNA-protein 

10 fusion. According to WO 00/32823 a DNA primer is covalently connected to 
the 3' end of the mRNA strand and extended by reverse transcriptase a to 
prepare the complementing DNA strand. The original RNA strand may be 
digested by RNase H. Another suitable method for generating a library is 
disclosed in WO 01/90414, the content of which is incorporated herein by 

15 reference. 

In accordance with another option, the identifier oligonucleotide is associated 
with a polypeptide display molecule using a method generally referred to as 
ribosome display. Ribosome display is disclosed in WO 93/03172, the 
20 content of which is included herein by reference. A still further option for 
association is phage display, in which a polypeptide display molecule is 
presented on the capsule of the phage and the same capsule harbours the 
RNA or DNA responsible for the formation of the polypeptide. 

25 A further option for associating the display molecule with an identifier 

oligonucleotide includes the method disclosed in M. Yonezawa et al, Nucleic 
acid research, 2003, vol. 31, No. 19 e118 (included by reference). The 
method includes the initial provision of an oligonucleotide connected to biotin 
and compartmentalization thereof together with a transcription and translation 

30 system. The oligonucleotide comprises a fusion gene coding for streptavidin 
and a polypeptide display molecule. After the formation of the fusion protein 
in each compartment, the streptavidin part of the fusion protein binds to the 
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biotin moiety of the oligonucleotide, thereby associating the display molecule 
with the oligonucleotide coding for the identity thereof. 

In case the display molecule is a nucleic acid, it may be of the aptamer type, 
5 i.e. a library of aptamers comprising constant nucleic acid regions flanking a 
random oligonucleotide part. The random oligonucleotide part serves the 
dual function of a nucleic acid display molecule and the identifying 
oligonucleotide. 

10 The formation of a synthetic encoded molecule generally starts by a scaffold, 
i.e. a chemical unit having one or more reactive groups capable of forming a 
connection to another reactive group positioned on a chemical entity, thereby 
generating an addition to the original scaffold. A second chemical entity may 
react with a reactive group also appearing on the original scaffold or a 

1 5 reactive group incorporated by the first chemical entity. Further chemical 

entities may be involved in the formation of the final reaction product. The , 
formation of a connection between the chemical entity and the nascent 
encoded molecule may be mediated by a bridging molecule. As an example, 
if the nascent encoded molecule and the chemical entity both comprise an 

20 amine group a connection between these can be mediated by a dicarboxyiic 
acid. A display molecule is in general produced in vitro and may be a 
naturally occurring or an artificial substance. In an aspect of the invention, a - 
display molecule is not produced using the natural translation system in an in 
vitro process. In other aspects of the invention, the display molecule is a 

25 polypeptide produced using the natural translation machinery. 

The chemical entities that are precursors for structural additions or 
eliminations of the encoded molecule may be attached to a building block 
prior to the participation in the formation of the reaction product leading to the 
30 final display molecule. Besides the chemical entity, the building block 

generally comprises an anti-codon. In some embodiments the building blocks 
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also comprise an affinity region providing for affinity towards the. nascent 
complex. 

in a certain aspect of the invention, the reactants or chemical entities are 
5 suitably mediated to the nascent encoded molecule by a building block, 

which further comprises an anticodon. The anti-codon serves the function of 
transferring the genetic information of the building block in conjunction with 
the transfer of a chemical entity. The transfer of genetic information and 
chemical entity may occur in any order, however, it is important that a 
10 correspondence is maintained in the complex. The chemical entities are 
preferably reacted without enzymatic interaction in some aspects of the 
invention. Notably, ribosomes or enzymes having similar activity do 
preferably not mediate the reaction of the chemical entities. In another aspect 
of the invention a ribosome is used to translate an mRNA into a protein using 
15 a tRNA loaded with a natural or unnatural amino acid. In still another aspect 
of the invention, enzymes having catalytic activities different from that of 
ribosomes are used in the formation of the display molecule. 

According to certain aspects of the invention the genetic information of the 
20 anti-codon is transferred by specific hybridisation to a codon on a nucleic 

acid template. Another method for transferring the genetic information of the 
anti-codon to the nascent complex is to anneal an oligonucleotide 
complementary to the anti-codon and attach this oligonucleotide to the 
complex, e.g. by ligation. A still further method involves transferring the 
25 genetic information of the anti-codon to the nascent complex by an extension 
reaction using a polymerase and a mixture of dNTPs. 

The chemical entity of the building block may in certain cases be regarded as 
a precursor for the structural entity eventually incorporated into the encoded 
30 molecule. In other cases the chemical entity provides for the eliminations of 
chemical units of the nascent encoded molecule. Therefore, when it in the 
present application with claims is stated that a chemical entity is reacted with 
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a nascent encoded molecule it is to be understood that not necessarily all the 
atoms of the original chemical entity is to be found in the eventually formed 
encoded molecule. Also, as a consequence of the reactions involved in the 
connection, the structure of the chemical entity can be changed when it 
5 appears on the nascent encoded molecule. Especially, the cleavage resulting 
in the release of the entity may generate a reactive group, which in a 
subsequent step can participate in the formation of a connection between a 
nascent complex and a chemical entity. 

1 0 The chemical entity of the building block comprises at least one reactive 
group capable of participating in a reaction, which results in a connection 
between the chemical entity of the building block and another chemical entity 
or a scaffold associated with the nascent complex. The number of reactive 
groups, which appears on the chemical entity, is suitably one to ten. A 

15 building block featuring only one reactive group is used La. in the end 
positions of polymers or scaffolds, whereas building blocks having two 
reactive groups are suitable for the formation of the body part of a polymer or 
scaffolds capable of being reacted further. One, two or more reactive groups 
intended for the formation of connections are typically present on scaffolds. 

20 Non-limiting examples of scaffolds are opiates, steroids, benzodiazepines, 
hydantoines, and peptidylphosphonates. 

The reactive group of the chemical entity may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
25 of the building block may be capable of forming a connection to a reactive 
group of the nascent complex through a bridging fill-in group. It is to be 
understood that not all the atoms of a reactive group are necessarily 
maintained in the connection formed. Rather, the reactive groups are to be 
regarded as precursors for the structure of the connection. 

30 

The subsequent cleavage step to release the chemical entity from the 
building block can be performed in any appropriate way. In an aspect of the 
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invention the cleavage involves usage of a chemical reagent or an enzyme. 
The cleavage results in a transfer of the chemical entity to the nascent 
encoded molecule or in a transfer of the nascent encoded molecule to the 
chemical entity of the building block. In some cases it may be advantageous 
5 to introduce new chemical groups as a consequence of linker cleavage. The 
new chemical groups may be used for further reaction in a subsequent cycle, 
either directly or after having been activated. In other cases it is\desirable 
that no trace of the linker remains after the cleavage. 

10 In another aspect, the formation of connection between chemical entity and 
nascent encoded molecule and the cleavage between chemical entity and 
the remainder of the building block is conducted as a simultaneous reaction, 
i.e. either the chemical entity of the building block or the nascent encoded 
molecule is a leaving group of the reaction. In some aspects of the invention, 

15 it is appropriate to design the system such that the connection and the 

cleavage occur simultaneously because this will reduce the number of steps 
and the complexity. The simultaneous connection and cleavage can also be 
designed such that either no trace of the linker remains or such that a new 
chemical group for further reaction is introduced, as described above. 

20 

The attachment of the chemical entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the 
chemical entity can be attached to a nucleobase or the backbone, in general, 
it is preferred to attach the chemical entity at the phosphor of the 

25 internucleoside linkage or at the nucleobase. When the nucleobase is used 
for attachment of the chemical entity, the attachment point is usually at the 7 
position of the purines or 7-deaza-purins or at the 5 position of pyrimidines. 
The nucleotide may be distanced from the reactive group of the chemical 
entity by a spacer moiety. The spacer may be designed such that the 

30 conformational spaced sampled by the reactive group is optimized for a 
reaction with the reactive group of the nascent encoded molecule. 
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The display molecules of the invention may have any chemical structure. In a 
preferred aspect, the display molecule can be any compound that may be 
synthesized in a component-by-component fashion. In some aspects the 
display molecule is a linear or branched polymer. In another aspect the 
display molecule is a scaffolded molecule. The term "display molecule" also 
comprises naturally occurring molecules like a-polypeptides etc, however 
produced in vitro usually in the absence of enzymes, like ribosomes. In 
certain aspects, the display molecule of the library is a non-a-polypeptide. 

1 0 The display molecule may have any molecular weight. However, in order to 
be orally available, it is in this case preferred that the display molecule has a 
molecular weight less than 2000 Daltons, preferably less than 1000 Dalton, 
and more preferred less than 500 Daltons. 

15 The size of the library may vary . considerably pending on the expected result 
of the inventive method. In some aspects, it may be sufficient that the library 
comprises two, three, or four different complexes. However, in most events, 
more than two different complexes are desired to obtain a higher diversity. In 
some aspects, the library comprises 1,000 or more different complexes; more 

20 preferred 1 ,000,000 or more different complexes. The upper limit for the size 
of the library is only restricted by the size of the vessel in which the library is 
comprised. It may be calculated that a vial may comprise up to 10 14 different 
complexes. 

25 Methods for forming libraries of complexes 

The complexes comprising an identifier oligonucleotide having two or more 
codons that codes for reactants that have reacted in the formation of the 
molecule part of the complex may be formed by a variety of processes. 
Generally, the preferred methods can be used for the formation of virtually 

30 any kind of encode molecule. Suitable examples of processes include prior 
art methods disclosed in WO 93/20242, WO 93/06121 , WO 00/23458, WO 
02/074929, and WO 02/103008, the content of which being incorporated 
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herein by reference as well as methods of the present applicant not yet public 
available, including the methods disclosed in PCT/DK03/00739 filed 30 
October 2003, and DK PA 2003 00430 filed 20 March 2003. Any of these 
methods may be used, and the entire content of the patent applications are 
5 included herein by reference. 

Below five presently preferred embodiments are described. A first 
embodiment disclosed in more detail in WO 02/103008 is based on the use 
of a polymerase to incorporate unnatural nucleotides as building blocks. 

10 Initially, a plurality of template oligonucleotides is provided. Subsequently 

primers are annealed to each of the templates and a polymerase is extending 
the primer using nucleotide derivatives, which have appended chemical 
entities. Subsequent to or simultaneously with the incorporation of the 
nucleotide derivatives, the chemical entities are reacted to form a reaction 

15 product. The encoded molecule may be post-modified by cleaving some of 
the linking moieties to better present the encoded molecule. 

Several possible reaction approaches for the chemical entities are apparent. 
First, the nucleotide derivatives can be incorporated and the chemical entities 

20 subsequently polymerised. In the event the chemical entities each carry two 
reactive groups, the chemical entities can be attached to adjacent chemical 
entities by a reaction of these reactive groups. Exemplary of the reactive 
groups are amine and carboxylic acid, which upon reaction form an amide 
bond. Adjacent chemical entities can also be linked together using a linking 

25 or bridging moiety. Exemplary of this approach is the linking of two chemical 
entities each bearing an amine group by a bi-carboxylic acid. Yet another 
approach is the use of a reactive group between a chemical entity and the 
nucleotide building block, such as an ester or a hoister group. An adjacent 
building block having a reactive group such as an amine may cleave the 

30 interspaced reactive group to obtain a linkage to the chemical entity, e.g. by 
an amide linking group. 
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A second embodiment for obtainment of complexes disclosed in WO 
02/103008 pertains to the use of hybridisation of building blocks to a template 
and reaction of chemical entities attached to the building blocks in order to 
obtain a reaction product. This approach comprises that templates are 
5 contacted with a plurality of building blocks, wherein each building block 
comprises an anti-codon and a chemical entity. The anti-codons are 
designed such that they recognise a sequence, i.e. a codon, on the template. 
Subsequent to the annealing of the anti-codon and the codon to each other a 
reaction of the chemical entity is effected. 

10 

The template may be associated with a scaffold. Building blocks bringing 
chemical entities in may be added sequentially or simultaneously and a 
reaction of the reactive group of the chemical entity may be effected at any 
time after the annealing of the building blocks to the template. 

15 

A third embodiment for the generation of a complex includes chemical or 
enzymatic ligation of building blocks when these are lined up on a template. 
Initially, templates are provided, each having one or more codons. The 
templates are contacted with building blocks comprising anti-codons linked to 
20 chemical entities. The two or more anti-codons annealed on a template are 
subsequently ligated to each other and a reaction of the chemical entities is 
effected to obtain a reaction product. The method is disclosed in more detail 
in DK PA 2003 00430 filed 20 March 2003. 

25 A fourth embodiment makes use of the extension by a polymerase of an 
affinity sequence of the nascent complex to transfer the anti-codon of a 
building block to the nascent complex. The method implies that a nascent 
complex comprising a scaffold and an affinity region is annealed to a building 
block comprising a region complementary to the affinity section. 

30 Subsequently, the anti-codon region of the building block is transferred to the 
nascent complex by a polymerase. The transfer of the chemical entity may 



10 
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be transferred prior to, simultaneously with or subsequent to the transfer of 
the anti-codon. This method is disclosed in detail in PCT/DK03/00739. 

A fifths embodiment also disclosed in PCT/DK03/00739 comprises reaction 
of a reactant with a site reaction site on nascent Afunctional molecule and 
addition of a nucleic acid tag to the nascent bifunctional molecule using an 
enzyme, such as a ligase. When a library is formed, usually an array of 
compartments is used for reaction of reactants and enzymatic addition of 
tags with the nascent bifunctional molecule. 

Thus, the codons are either pre-made.into one or more templates before the 
encoded molecules are generated or the codons are transferred 
simultaneously with the formation of the encoded molecules. 

■par- . i ■ ■ 

1 5 After or simultaneously with the formation of the reaction product some of the 
linkers to the template may be cleaved, however, usually at least one linker is 
maintained to provide for the complex. 

Nucleotides 

20 The nucleotides used in the present invention may be iinked together in a 

sequence of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is 
normally composed of two parts, namely a nucleobase moiety, and a 
backbone. The backbone may in some cases be subdivided into a sugar 
moiety and an internucleoside linker. 

25 

The nucleobase moiety may be selected among naturally occurring 
nucleobases as well as non-naturally occurring nucleobases. Thus, 
"nucleobase" includes not only the known purine and pyrimidine hetero- 
cycles, but also heterocyclic analogues and tautomers thereof. Illustrative 
30 examples of nucleobases are adenine, guanine, thymine, cytosine, uracil, 
purine, xanthine, diaminopurine, 8-oxo-N 6 -methyladenine, 7-deazaxanthine, 
7-deazaguanine, N 4 ,N 4 -ethanocytosin, N 6 ,N 6 -ethano-2,6-diamino-purine, 5- 
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methylcytosine, 5-(C 3 -C 6 )-alkynylcytosine, 5-fluorouracil, 5-bromouracil, 
pseudoisocytosine, 2-hydroxy-5-methyl-4-triazolopyridine, isocytosine, 
isoguanine, inosine and the "non-naturally occurring" nucleobases described 
in Benner et al., U.S. Pat No. 5,432,272. The term "nucleobase" is intended 
5 to cover these examples as well as analogues and tautomers thereof. 

Especially interesting nucleobases are adenine, guanine, thymine, cytosine, 
5-methyloytosine, and uracil, which are considered as the naturally occurring 
nucleobases in relation to therapeutic and diagnostic application in humans. 



10 Examples of suitable specific pairs of nucleobases are shown below: 
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N atural Base Pairs 




Nl H 2 
N 



R = H : U racil 
R = CH 3 : Thymine 

N B ackbone 



B ackbone 



Adenine 



N 

Backbone 




H 2 N 



NH 
N^NH 2 



Cytosine 




N "^Backbone 



G ua nine 



Synthetic Base Pairs 




e 



B ackbone 



Synthetic purine bases pair ring with n atu ra 1 p y rim id in e s 

R = H : U racil 

R = CH 3 : Thymine Cytosine 




mm us. /""Backbone 

NH 2 H.N_^ 



Backbone m — >k, K , ,. 



Backbone N — ^n^nH 2 

B a ckbon e 

7-deaza adenine 7-deaza guanine 

Suitable examples of backbone units are shown below (B denotes a nucleobase): 



> SUBSTITUTE SHEET (RULE 26) 
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2'-(3-hydroxy)propyl 



The sugar moiety of the backbone is suitably a pentose but may be the 
5 appropriate part of a PNA or a six-member ring. Suitable examples of 

possible pentoses include ribose, 2'-deoxyribose, 2-O-methyl-ribose, 2*-flour- 
ribose, and 2-4'-0-methylene-ribose (LNA). Suitably the nucleobase is 
attached to the 1' position of the pentose entity. 

1 0 An internucleoside linker connects the 3" end of preceding monomer to a 5' 
end of a succeeding monomer when the sugar moiety of the backbone is a 
pentose, like ribose or 2-deoxyribose. The internucleoside linkage may be 
the natural occurring phospodiester linkage or a derivative thereof. 
Examples of such derivatives include phosphorothioate, methylphosphonate, 

1 5 phosphoramidate, phosphotriester, and phosphodithioate. Furthermore, the 
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internucleoside linker can be any of a number of non-phosphorous-containing 
linkers known in the art. 

Preferred nucleic acid monomers include naturally occurring nucleosides 
forming part of the DNA as well as the RNA family connected through 
phosphodiester linkages. The members of the DNA family include 
deoxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine. The 
members of the RNA family include adenosine, guanosine, uridine, cytidine, 
and inosine. Inosine is a non-specific pairing nucleoside and may be used as 
universal base because inosine can pair nearly isoenergetically with A, T, 
and C. Other compounds having the same ability of non-specifically base- 
pairing with natural nucleobases have been formed. Suitable compounds 
which may be utilized in the present invention includes among others the 
compounds depicted below 
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Examples of Universal Bases: 
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Building block 

The chemical entities or reactants that are precursors for structural additions 
-or eliminations of the encoded molecule may be attached to a building block 
5 prior to the participation in the formation of the reaction product leading to the 
final encoded molecule. Besides the chemical entity, the building block 
generally comprises an anti-codon. 

The chemical entity of the building block comprises at least one reactive 
10 group capable of participating in a reaction, which results in a connection 

between the chemical entity of the building block and another chemical entity 
or a scaffold associated with the nascent complex. The connection is 
facilitated by one or more reactive groups of the chemical entity. The number 
of reactive groups, which appear on the chemical entity, is suitably one to 
15 ten. A building block featuring only one reactive group is used i.a. in the end 
positions of polymers or scaffolds, whereas building blocks having two 
reactive groups are suitable for the formation of the body part of a polymer or 
scaffolds capable of being reacted further. One, two or more reactive groups 
intended for the formation of connections are typically present on scaffolds. 

20 

The reactive group of the building block may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
group of the nascent complex through a bridging fill-in group. It is to be 
25 understood that not all the atoms of a reactive group are necessarily 

maintained in the connection formed. Rather, the reactive groups are to be 
regarded as precursors for the structure of the connection. 

The subsequent cleavage step to release the chemical entity from the 
30 building block can be performed in any appropriate way. In an aspect of the 
invention the cleavage involves usage of a reagent or an enzyme. The 
cleavage results in a transfer of the chemical entity to the nascent encoded 
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molecule or in a transfer of the nascent encoded molecule to the chemical 
entity of the building block. In some cas^ i> it may be advantageous to 
introduce new chemical groups as a consequence of linker cleavage. The 
new chemical groups may be used for further reaction in a subsequent cycle, 
5 either directly or after having been activated. In other cases it is desirable 
that no trace of the linker remains after the cleavage. 

In another aspect, the connection and the cleavage are conducted as a 
simultaneous reaction, i.e. either the chemical entity of the building block or 

10 the nascent encoded molecule is a leaving group of the reaction. In general, 
it is preferred to design the system such that the connection and the 
cleavage occur simultaneously because this will reduce the number of steps 
and the complexity. The simultaneous connection and cleavage can also be 
designed such that either no trace of the linker remains or such that a new 

15 chemical group for further reaction is introduced, as described above. 

The attachment of the chemical entity- to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the 
chemical entity can be attached to a nucleobase or the backbone. In general, 

20 it is preferred to attach the chemical entity at the phosphor of the 

interniicleoside linkage or at the nucleobase. When the nucleobase is used 
for attachment of the chemical entity, the attachment point is usually at the 7 
position of the purines or 7-deaza-purins or at the 5 position of pyrimidines. 
The nucleotide may be distanced from the reactive group of the chemical 

25 entity by a spacer moiety. The spacer may be designed such that the 
conformational space sampled by the reactive group is optimized for a 
reaction with the reactive group of the nascent encoded molecule or reactive 
site. 

30 The anticodon complements the codon of the identifier oligonucleotide 

sequence and generally comprises the same number of nucleotides as the 



WO 2005/078122 



PCT7DK2005/000106 



29 

codon. The anticodon may be adjoined with a fixed sequence, such as a 
sequence complementing a framing sequence. 

Various specific building blocks are envisaged. Building blocks of particular 



Building blocks transferring a chemical entity to a recipient nucleophilic group 
The building block indicated below is capable of transferring a chemical entity 
(CE) to a recipient nucleophilic group, typically an amine group. The bold 

10 lower horizontal line illustrates the building block comprising an anti-codon 
and the vertical line illustrates a spacer. The 5-membered substituted N- 
hydroxysuccinimid (NHS) ring serves as an activator, i.e. a labile bond is 
formed between the oxygen atom connected to the NHS ring and the 
chemical entity. The labile bond may be cleaved by a nucleophilic group, e.g. 

1 5 positioned on a scaffold 



The 5-membered substituted N-hydroxysuccinimid (NHS) ring serves as an 
20 activator, i.e. a labile bond is formed between the oxygen atom connected to 
the NHS ring and the chemical entity. The labile bond may be cleaved by a 
nucleophilic group, e;g. positioned on a scaffold, to transfer the chemical 
entity to the scaffold, thus converting the remainder of the fragment into a 
leaving group of the reaction. When the chemical entity is connected to the 
25 activator through a carbonyl group and the recipient group is an amine, the 
bond formed on the scaffold will an amide bond. The above building block is 



5 



interest are shown below. 
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the subject of WO03078627A2, the content of which is incorporated herein in 
their entirety by reference. 

Another building block, which may form an amide bond, is 



R may be absent or N0 2 , CF 3 , halogen, preferably CI, Br, or I, and Z may be 
S or O. This type of building block is disclosed in WO03078626A2. The 
content of this patent application is incorporated herein in the entirety by 
reference. 

A nucleophilic group can cleave the linkage between Z and the carbonyl 
group thereby transferring the chemical entity -(C=0)-CE' to said 
nucleophilic group. 

Building blocks transferring a chemical entity to a recipient reactive group 
forming a C=C bond 

A building block as shown below is able to transfer the chemical entity to a 
recipient aldehylde group thereby forming a double bond between the carbon 
of the aldehyde and the chemical entity 



o 
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The above building block is disclosed in WO03078445A2, the content of 
which being incorporated herein in the entirety by reference. 

5 

Building blocks transferring a chemical entity to a recipient reactive group 
forming a C-C bond 

The below building block is able to transfer the chemical entity to a recipient 
group thereby forming a single bond between the receiving moiety, e.g. a 
10 scaffold, and the chemical entity. 



\ 

B- 

/ 



-CE 



15 



The above building block is disclosed in WO03078445A2, the content of 
which being incorporated herein in the entirety by reference. 

Another building block capable of transferring a chemical entity to a receiving 
reactive group forming a single bond is 
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-CE 



The receiving group may be a nucleophile, such as a group comprising a 
hetero atom, thereby forming a single bond between the chemical entity and 
5 the hetero atom, or the receiving group may be an electronegative carbon 
atom, thereby forming a C-C bond between the chemical entity and the 
scaffold. The above building block is disclosed in WO03078446A2, the 
content of which is incorporated herein by reference. 

1 0 The chemical entity attached to any of the above building blocks may be a 
selected from a large arsenal of chemical structures. Examples of chemical 
entities are 

H or entities selected among the group consisting of a CrC 6 alkyl, C2-C 6 
alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl, C 3 -C 7 cycloalkyl, C 3 -C 7 
1 5 cycloheteroalkyl, aryl, and heteroaryl, said group being substituted with 0-3 
R 4 , 0-3 R 5 and 0-3 R 9 or C r C 3 alkylene-NR 4 2 , d-C 3 alkylene-NR 4 C(0)R 8 . 
d-C 3 alkylene-NR 4 C(0)OR 8 , C r C 2 alkylene-0-NR 4 2 , Ci-C 2 
alkylene-0-NR 4 C(0)R 8 . Ci-C 2 alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 

R 9 . 

20 where R 4 is H or selected independently among the group 

consisting of C r C 6 alkyl, QrCe alkenyl, C2-C 6 alkynyl, C 3 -C 7 cycloalkyl, C 3 -C 7 
cycloheteroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R 9 
and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , 
25 -NHOH, -NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2 , -P(O)(0R 6 ) 2 or the group 
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consisting of C^Cs alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl said group being 
substituted with 0-2 R 7 , 

where R 6 is selected independently from H, Ci-C 6 alkyl, C 3 .C 7 
cycloalkyl, aryl or d-C 6 alkylene-aryl substituted with 0-5 halogen atoms 
5 selected from -F, -CI, -Br, and —I; and 

R 7 is independently selected from -N0 2) -COOR 6 , -COR 6 , -CN, 

-OSiR 6 3, -OR 6 and -NR 6 2 . 

R 8 is H, d-Ce alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 -C 7 
cycloalkyl, aryl or Ci-C 6 alkylene-aryl substituted with 0-3 substituents 
1 0 independently selected from -F, -CI, -N0 2 , -R 3 , -OR 3 , -SiR 3 3 

R 9 is =0, -F, -CI, -Br, -I, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , 
-NR 6 -C(0)OR 8 , -SR 6 , -S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and 
-S(0) 2 NR 6 2 . 

1 5 Cross-link cleavage building blocks 

It may be advantageous to split the transfer of a chemical entity to a recipient 
reactive group into two separate steps, namely a cross-linking step and a 
cleavage step because each step can be optimized. A suitable building block 
for this two-step process is illustrated below: 



20 




A 



25 



Initially, a reactive group appearing on the functional entity precursor 
(abbreviated FEP) reacts with a recipient reactive group, e.g. a reactive 
group appearing on a scaffold, thereby forming a cross-link. Subsequently, a 
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cleavage is performed, usually by adding an aqueous oxidising agent such 
as l 2 , Br 2 , Cl 2 , H + , or a Lewis acid. The cleavage results in a transfer of the 
group HZ-FEP- to the recipient moiety, such as a scaffold. 

5 In the above formula 

Z is O, S, NR 4 
Q is N, CR 1 

P is a valence bond, O, S, NR 4 , or a group Cjw-arylene, C-i- 
6 alkylene, Ci. 6 0-alkylene, Ci- 6 S-alkylene, NR 1 -alkylene, Ci- 6 alkylene-0, d- 
10 6 alkylene-S option said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 
R 9 orCi-C 3 alkylene-NR 4 2 , d-C 3 alkylene-NR 4 C(0)R 8 , d-C 3 
alkylene-NR 4 C(6)OR 8 , d-C 2 alkylene-0-NR 4 2> d-C 2 
alkylene-0-NR 4 C(0)R 8 , C1-C2 alkylene-0-NR 4 C(0)QR 8 substituted with 0-3 

15 B is a group comprising D-E-F, in which 

D is a valence bond or a group d. 6 alkylene, d. 6 alkenylene, d- 
6 alkynyiene, Cs-yarylene, or Cs-yheteroarylene, said group optionally being 
substituted with 1 to 4 group R 11 , 

E is, when present, a valence bond, O, S, NR 4 , or a group d- 
20 6 alkylene, d^alkenylene, d. 6 alkynylene, C 5 -7arylene, or Cs-yheteroarylene, 
said group optionally being substituted with 1 to 4 group R , 
F is, when present, a valence bond, O, S, or NR 4 , 
A is a spacing group distancing the chemical structure from the 
complementing element, which may be a nucleic acid, 
25 R 1 , R 2 , and R 3 are independent of each other selected among 

the group consisting of H, d-C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 
alkadienyl, C 3 -C 7 cycloalkyl, C3-C7 cycloheteroalkyl, aryl, and heteroaryl, said 
group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or C1-C3 
alkylene-NR 4 2 , C1-C3 alkylene-NR 4 C(0)R 8 , d-C 3 alkylene-NR 4 C(0)OR 8 , 
30 C r C 2 alkylene-0-NR 4 2) Ci-C 2 alkylene-0-NR 4 C(0)R 8 , d-C 2 
alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 
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FEP is a group selected among the group consisting of H, C r C 6 
alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 aikadienyl, C 3 -C 7 cycloalkyl, C3-C7 
cycloheteroalkyl, aryl, and heteroaryl, said group being substituted with 0-3 
R 4 , 0-3 R 5 and 0-3 R 9 or d-C 3 a!kylene-NR 4 2 , C r C 3 alkylene-NR 4 C(0)R 8 , 
5 d-C 3 alkylene-NR 4 C(0)OR 8 , d-C 2 alkylene-0-NR 4 2 , Ci-C 2 

alkylene-0-NR 4 C(0)R 8 , d-C 2 alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 

R 9 , 

where R 4 is H or selected independently among the group 
consisting of C r C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 -C 7 cycloalkyl, C 3 -C 7 
10 cycloheteroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R 9 
and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , 
-NHOH, -NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group 
consisting of C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 aikadienyl said group being 
15 substituted with 0-2 R 7 , 

where R 6 is selected independently from H, Ci-C 6 alkyl, C^d 
cycloalkyl, aryl or Ci-C 6 alkylene-aryl substituted with 0-5 halogen atoms 
selected from -F, -CI, -Br, and -I; and R 7 is independently selected from - 
N0 2 , -COOR 6 , -COR 6 , -CN, -OSiR 6 3 , -OR 6 and -NR 6 2 . 
20 R 8 is H, d-C 6 alkyl, C 2 -C 6 alkenyl, d-Ce alkynyl, C 3 -C 7 cycloalkyl, aryl or 
Ci-C 6 alkylene-aryl substituted with 0-3 substituents independently selected 
from -F, -CI, -N0 2 , -R 3 , -OR 3 , -SiR 3 3 

R 9 i s =0, -F, -CI, -Br, -l, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , 
-NR 6 -C(0)OR 8 , -SR 6 , -S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and 
25 -S(0) 2 NR 6 2 . 

In a preferred embodiment Z is O or S, P is a valence bond, Q is CH, B is 
CH 2 , and R 1 , R 2 , and R 3 is H. The bond between the carbonyl group and Z is 
cleavable with aqueous l 2 . 



30 
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Contacting between target and library 

The contacting step, by which the library of. Afunctional molecules is 
subjected under binding conditions to a target, may be referred to as the 
enrichment step or the selection step, as appropriate, and includes the 
5 screening of the library for display molecules having predetermined desirable 
characteristics. Predetermined desirable characteristics can include binding 
to a target, catalytically changing the target, chemically reacting with a target 
in a manner which alters/modifies the target or the functional activity of the 
target, and covalently attaching to the target as in a suicide inhibitor. 

10 

In theory, display molecules of interest can be selected based on their 
properties using either physical or physiological procedures. The method 
preferred according to the present invention is to enrich molecules with 
respect to binding affinity towards a target of interest, in a certain 

15 embodiment, the basic steps involve mixing the library of complexes with the 
target of interest. The target can be attached to a column matrix or microtitre 
wells with direct immobilization or by means of antibody binding or other 
high-affinity interactions. In another embodiment, the target and displayed 
molecules interact without immobilisation of the target. Displayed molecules 

20 that bind to the target will be retained on this surface, while nonbinding 
displayed molecules in a certain aspect of the invention will be removed 
during a single or a series of wash steps. The identifier oligonucleotides of 
complexes bound to the target can then be recovered. It may be considered 
advantageously to perform a chromatography step after or instead of the 

25 washing step, notably in cases where the target is not immobilized. After the 
recovery of the identifier oligonucleotides they are optionally amplified before 
the decoding step. 

A significant reduction in background binders may be obtained with increased 
30 washing volumes, repeating washing steps, higher detergent concentrations 
and prolonged incubation during washing. Thus, the more volume and 
number of steps used in the washing procedure together with more stringent 
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conditions the more efficiently the non-binders and background binders will 
be removed. The right stringency in the washing step can also be used to 
remove low-affinity specific binders. However, the washing step will also 
remove wanted binders if too harsh conditions are used. 

5 

A blocking step, such as incubation of solid phase with skimmed milk 
proteins or other inert proteins and/or mild detergent such as Tween-20 and 
Triton X-100, may also be used to reduce the background. The washing 
conditions should be as stringent as possible to remove background binding 
10 but to retain specific binders that interact with the target. Generally, washing 
conditions are adjusted to maintain the desired affinity binders, e.g. binders in 
the micro molar, nanomolar, or picomoiar range. 

The present invention takes advantages of the fact that the identifier 
15 oligonucleotides of low-binding complexes will be in a low concentration 
compared to the identifier oligonucleotides of complexes binding with high 
affinity. The generated imbalance can be enhanced in the subsequent 
formation of a mixture of homo- and hetoro-duplexes and the recovery of 
homo-duplexes. 

20 

The target can be any compound of interest. E.g. the target can be a protein, 
peptide, carbohydrate, polysaccharide, glycoprotein, hormone, receptor, 
antigen, antibody, virus, substrate, metabolite, transition state analogue, 
cofactor, inhibitor, drug, dye, nutrient, growth factor, cell, tissue, etc. without 

25 limitation. Suitable targets include, but are not limited to, angiotensin 
converting enzyme, renin, cyclooxygenase, 5-iipoxygenase, IIL- 1 0 
converting enzyme, cytokine receptors, PDGF receptor, type II inosine 
monophosphate dehydrogenase, p-lactamases, integrin, proteases like factor 
Vila, kinases like Bcr-Abl/Her, phosphotases like PTP-1B, and fungal 

30 cytochrome P-450. Targets can include, but are not limited to, bradykinin, 
neutrophil elastase, the HIV proteins, including tat, rev, gag, int, RT, 
nucleocapsid etc., VEGF, bFGF, TGF£, KGF, PDGF, GPCR, thrombin, 
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substance P, IgE, sPLA2, red blood cells, glioblastomas, fibrin clots, PBMCs, 
hCG, lectins, selectins, cytokines, ICP4, complement proteins, etc. 

A target can also be a surface of a non-biological origin, such as a polymer 
5 surface or a metal surface. The method of the invention may then be used to 
identify suitable coatings for such surfaces. 

In a preferred embodiment, the desirable display molecule acts on the target 
without any interaction between the nucleic acid attached to the desirable 
10 encoded molecule and the target. In one embodiment, the bound complex- 
target aggregate can be partitioned from unbound complexes by a number of 
methods. The methods include nitrocellulose filter binding, column 
chromatography, filtration, affinity chromatography, centrifugation, and other 
well known methods. A preferred method is size-exclusion chromatography. 

15 

Briefly, the library of complexes is subjected to the target, which may include 
contact between the library and a column onto which the target is 
immobilised. Identifier oligonucleotides associated with undesirable display 
molecules, i.e. display molecules not bound to the target under the stringency 

20 conditions used, will pass through the column. Additional undesirable display 
molecules (e.g. display molecules which cross-react with other targets) may 
be removed by counter-selection methods. Desirable complexes are bound 
to the column. The target may be immobilized in a number of ways. In one 
embodiment, the target is immobilized through a cleavable physical link, such 

25 as one more chemical bonds. 

The complex may be provided with a cleavable linker at a position between 
the display molecule and the identifier oligonucleotide. When the target is 
immobilized, the cleavable linker of the complex is preferable orthogonal to 
30 the cleavable linker that attached the target to the solid support. The 

cleavable linker may be cleaved to separate the identifier oligonucleotides of 
complexes having affinity towards the targets. Just to mention a single type 
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of orthogonal cleavable linkages, one could attached to target to the solid 
support through a linkage that can be cleaved by a chemical agent, and the 
linker separating the display molecule and the identifier oligonucleotide may 
be selected as a photocleavable linkage. More specifically, the former linkage 
5 may be a disulphide bond that can be cleaved by a suitable reducing agent 
like DTT (dithiothreitol) and the latter linkage may be an o-nitrophenyl group. 

There are other partitioning and screening processes which are compatible 
with this invention that are known to one of ordinary skill in the art. Such 
10 known process may be used in combination with the present inventive 
method. In one embodiment, the complex-target aggregate can be 
fractionated by common methods and then each fraction is assayed for 
activity. The fractionization methods can include size, pH s hydrophobicity, 
etc. 

15 

Inherent in the present method is the selection of encoded molecules on the 
basis of a desired function; this can be extended to the selection of 
molecules with a desired function and specificity. Specificity can be required 
during the selection process by first extracting complexes which are capable 

20 of interacting with a non-desired "target" (negative selection, or counter- 
selection), followed by positive selection with the desired target. As an 
example, inhibitors of fungal cytochrome P-450 are known to cross-react to 
some extent with mammalian cytochrome P-450 (resulting in serious side 
effects). Highly specific inhibitors of the fungal cytochrome could be selected 

25 from a library by first removing those complexes capable of interacting with 
the mammalian cytochrome, followed by retention of the remaining products 
which are capable of interacting with the fungal cytochrome. 

In a certain embodiment, a binding platform may be constructed that can be 
30 used for almost any target. The binding platform should preferably be small 
enough to only allow association of a few or a single target molecule. This to 
ensure a solution based selection procedure with adjustable target 
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concentration. The binding platform is primarily composed of two 
components; a small surface allowing association of the target molecule, and 
an association area/site for the target oligonucleotide. This binding platform 
may be designed to mediate the association of the target and target 
5 oligonucleotide to allow proximity selection in solution. 

Cleavable linkers 

A cleavable linker may be positioned between the target and a solid support, 
between the display molecule and the identifier oligonucleotide, or any other 
1 0 position that can provide for a separation of the identifier oligonucleotides of 
successful complexes from non-specific binding complexes. The cleavable 
linker may be selectively cleavable, i.e. conditions may selected that only 
cleave that particular linker. 

1 5 The cleavable linkers may be selected from a large plethora of chemical 

structures. Examples of linkers include, but are not limited to, linkers having 
an enzymatic cleavage site, linkers comprising a chemical degradable 
component, and linkers cleavable by electromagnetic radiation, such as light. 

20 Examples of linkers cleavable by electromagnetic radiation (light) 
o-nitrobenzyl p-alkoxy 




,2 



25 



o-nitrobenzyl in exo position 
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For more details see Holmes CP. J. Org. Chem. 1997, 62, 2370-2380 



3-nitrophenyloxy 

— V 9 ° 

j OH OH 
0 2 N h 

For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 

Dansyl derivatives: 
R 2 




For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 

Coumarin derivatives 

,NR 2 R 3 

hv H— NR 2 R 3 




FrI-CT^^O^O H-Donor 

For more details see R. O. Schoenleber, B. Giese. Synlett 2003, 501-504 

R 1 and R 2 can be either of the potential drug candidate and the identifier 
oligonucleotide, respectively. Alternatively, R 1 and R 2 can be either of the 
target or a solid support, respectively. 
R 3 = Hor OCH 3 

If X is O then the product will be a carboxylic acid 
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If X is NH the product will be a carboxamide 



One specific example is the PC Spacer Phosphoramidite (Glen research 
catalog # 10-4913-90) which can be introduced in an oligonucleotide during 
synthesis and cleaved by subjecting the sample in water to UV light (~ 300- 
350 nm) for 30 seconds to 1 minute. 



The above PC spacer phosphoamidite is suitable incorporated in a library of 
complexes at a position between the indentifier and the potential drug 
candidate. The spacer may be cleaved according to the following reaction. 



R 1 and R 2 can be either of the encoded molecule and the identifying 
molecule, respectively. In a preferred aspect R 2 is an oligonucleotide 
identifier and the R 1 is the potential drug candidate. When the linker is 
25 cleaved a phosphate group is generated allowing for further biological 

reactions. As an example, the phosphate group may be positioned in the 
5'end of an oligonucleotide allowing for an enzymatic ligation process to take 




-0-P-N(iPr) 2 

6-CNEt 
N0 2 



DMT = 4,4'-Dimethoxytrityl 
iPr = Isopropyl 
CNEt = Cyanoethyl 




o 

HO-P-O-R 2 
OH 



place. 



WO 2005/078122 

43 

Examples of linkers cleavable by chemical agents: 
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Ester linkers can be cleaved by nucleophilic attack using e.g. hydroxide ions. 
In practice this can be accomplished by subjecting the target-ligand complex 
to a base for a short period. 
R 3 R 4 




OH- 



O 

■Ok 



OH 



R 3 R 4 




R 1 and R 2 can be the either of be the potential drug candidate or the identifier 
10 oligonucleotide, respectively. R 4 " 6 can be any of the following: H, CN, F, N0 2 , 
S0 2 NR 2 . 

Disulfide linkers can efficiently be cleaved / reduced by Tris (2-carboxyethyl) 
phosphine (TCEP). TCEP selectively and completely reduces even the most 

1 5 stable water-soluble alkyl disulfides over a wide pH range. These reductions 
frequently required less than 5 minutes at room temperature. TCEP is a non- 
volatile and odorless reductant and unlike most other reducing agents, it is 
resistant to air oxidation. Trialkylphosphines such as TCEP are stable in 
aqueous solution, selectively reduce disulfide bonds, and are essentially 

20 unreactive toward other functional groups commonly found in proteins. 



25 



R^S-S-R 2 + 




+ H 2 0 



R'-SH + HS-R 2 + 1 / — ^ 



,OH 



More details on the reduction of disulfide bonds can be found in Kirley, 
T.L(1989), Reduction and fluorescent labeling of cyst(e)ine-containing 
proteins for subsequent structural analysis, Anal. Biochem. 180, 231 and 
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Levison, M.E., ef a/. (1969), Reduction of biological substances by water- 
soluble phosphines: Gamma-globulin. Experentia 25, 126-127. 

Linkers cleavable by enzymes 

5 The linker connecting the potential drug candidate with the identifier 

oligonucleotide or the solid support and the target can include a peptide 
region that allows a specific cleavage using a protease. This is a well-known 
strategy in molecular biology. Site-specific proteases and their cognate target 
amino acid sequences are often used to remove the fusion protein tags that 

1 0 facilitate enhanced expression, solubility, secretion or purification of the 

fusion protein. 1 J 

Various proteases can be used to accomplish a specific cleavage. The 
specificity is especially important when the cleavage site is presented 
together with other sequences such as for example the fusion proteins. 

1 5 Various conditions have been optimized in order to enhance the cleavage 
efficiency and control the specificity. These conditions are available and 
know in the art. 

Enterokinase is one example of an enzyme (serine protease) that cut a 
20 specific amino acid sequence. Enterokinase recognition site is Asp-Asp-Asp- 
Asp-Lys (DDDDK), and it cleaves C-terminally of Lys. Purified recombinant 
Enterokinase is commercially available and is highly active over wide ranges ( ; 
in pH (pH 4.5-9.5) and temperature (4-45°C). 

25 The nuclear inclusion protease from tobacco etch virus (TEV) is another 

commercially available and well-characterized proteases that can be used to 
cut at a specific amino acid sequence. TEV protease cleaves the sequence 
Glu-Asn-Leu-Tyr-Phe-Gln-Gly/Ser (ENLYFQG/S) between Gln-Gly or Gln- 
Ser with high specificity. 

30 

Another well-known protease is thrombin that specifically cleaves the 
sequence Leu-Val-Pro-Arg-Gly-Ser (LVPAGS) between Arg-Gly. Thrombin 
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has also been used for cleavage of recombinant fusion proteins. Other 
sequences can also be used for thrombin cleavage; these sequences are 
more or less specific and more or less efficiently cleaved by thrombin. 
Thrombin is a highly active protease and various reaction conditions are 
5 known to the public. 

Activated coagulation factor FX (FXa) is also known to be a specific and 
useful protease. This enzyme cleaves C-terminal of Arg at the sequence lle- 
Glu-Gly-Arg (IEGR). FXa is frequently used to cut between fusion proteins 
1 0 when producing proteins with recombinant technology. Other recognition 
sequences can also be used for FXa. 

Other types of proteolytic enzymes can also be used that recognize specific 
amino acid sequences. In addition, proteolytic enzymes that cleave amino 
1 5 acid sequences in an un-specific manner can also be used if only the linker 
contains an amino acid sequence in the complex molecule. 

Other type of molecules such as ribozymes, catalytically active antibodies, or 
lipases can also be used. The only prerequisite is that the catalytically active 
20 molecule can cleave the specific structure used as the linker, or as a part of 
the linker, that connects the encoding region and the displayed molecule or, 
in the alternative the solid support and the target. 

A variety of endonucleases are available that recognize and cleave a double 
25 stranded nucleic acid having a specific sequence of nucleotides. The 

endonuclease Eco Rl is an example of a nuclease that efficiently cuts a 
nucleotide sequence linker comprising the sequence GAATTC also when this 
sequence is close to the nucleotide sequence length. Purified recombinant 
Eco Rl is commercially available and is highly active in a range of buffer 
30 conditions. As an example the Eco Rl is working in in various protocols as 
indicted below (NEBuffer is available from New England Biolabs): 
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NEBuffer 1 : [10 mM Bis Tris Propane-HC!, 10 mM MgCI2, 1 mM dithiothreitol 
(pH 7.0 at 25°C)], 

NEBuffer 2 : [50 mM NaCI, 10 mM Tris-HCl, 10 mM MgCI2, 1 mM 
dithiothreitol (pH 7.9 at25°C)], 
5 NEBuffer 3 : [100 mM NaCI, 50 mM Tris-HCl, 10 mM MgCI2, 1 mM 
dithiothreitol (pH 7.9 at25°C)], 

NEBuffer 4 : [50 mM potassium acetate, 20 mM Tris-acetate, 10 mM 
magnesium acetate, 1 mM dithiothreitol (pH 7.9 at 25°C)]. 
Extension buffer : mM KCl, 20 mM Tris-HCI(Ph 8.8 at 25o C), 10 mM (NH4 )2 
10 S04 , 2 mM MgSO 4 and 0.1 % Triton X-100, and 200 uM dNTPs. 



Formation of a mixture of homo- and hetero-duplexes 

After the identifier oligonucleotides of complexes comprising display 

molecules interacting in a certain desired fashion with a target are 

1 5 partitioned, a nucleic acid amplification is usually conducted. The nucleic acid 
amplification method may be PCR or a method aquivalent thereto. The 
amplification is preferably conducted such that the relative destribution of the 
individual identifier oligonucleotides are retained. According to a certain 
embodiment, identical PCR priming sites to obtain a proportional 

20 amplification of the identifier oligonucleotides surround the coding section. 

After the formation of duplexes, a denaturing step usually follows. A 
denaturing can be obtained by a variety of methods, including increased 
temperature, high salt concentration, presence of organic solvents, certain 
25 duplex disrupting chemicals, etc. Generally it is preferred to use an elevated 
temperature. When the temperature is increased above the melting 
temperature of the duplex the single stranded oligonucleotides are formed. At 
the elevated temperature, single stranded oligonucleotides complementing 
the partitioned identifier oligonucleotides optionally may be added. 



A mixture of homo- and hetero-duplexes is formed by subjecting the single 
stranded mixture to hybridisation conditions. When the denaturing is obtained 
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by elevating the temperature, the hybridisation conditions may appropriately 
be obtained by decreasing the temperature below the melting temperature of 
the homo-duplex. The temperature decrease rate may be adjusted according 
to the specific conditions used to obtain the optimal condition for the 

5 formation of the homo-duplexes. If a high temperature decrease rate is 

selected a lower tendency of homo-duplex formation is obtained. Conversely, 
a low temperature decrease rate implies a less tendency of hetero-duplex 
formation. When the denaturing is obtained using hybridisation modifying 
agents, like salt, solvents etc, hybridisation conditions may be obtained by 

10 desalination in case of salt and evaporation in case of solvents. 

As mentioned above, the mixture of denatured single stranded 
oligonucleotides may be added complementing oligonucleotides to obtain 
certain desired effects. In an aspect of the invention random oligonucleotides 

1 5 are added to the single stranded oligonucleotides to lower the general 

tendency of duplex formation. A less tendency may be desirable because the 
selectivity for the best binding display molecule increases. In another aspect 
of the invention, certain specific complementing oligonucleotides are added 
to increase the probability of duplex formation for certain kind of identifier 

20 oligonucleotides. Alternatively, the addition of certain complementing 

oligonucleotides may compensate for an initial library not having an even 
distribution of the concentration of individual members. 

In certain applications of the invention it is preferred to use amplification of 
25 only one strand, thereby to produce single stranded identifier 

oligonucleotides. The single stranded identifier oligonucleotides may then be 
mixed with a mixture of oligonucleotides complementing the original 
oligonucleotides of the library. The more frequent abundant single stranded 
identifier oligonucleotides will be more inclined to form a higher portion of 
30 homo-duplex compared to the less frequently occurring, which will tend to 
produce a higher portion of hetero-duplex. It may be desired to spike the 
partitioned fraction of single stranded identifier oligonucleotides with a higher 
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concentration of certain complementing oligonucleotides in order to bias the 
formation of homo- and hetero-duplexes. Alternatively, certain 
complementing oligonucleotides may be added in a lower amount. As an 
example, the identifier oligonucleotide coding for a known ligand may be 
5 extinguished because it is desired to find unknown ligands, by avoiding the 
addition of an oligonucleotide complementing said identifier oligonucleotide. 

In some aspects of the invention, a library of different display molecules, 
each being associated with an identifying oligonucleotide, is divided into two 

10 or more portions. A first portion may be contacted with a target and the 

identifier oligonucleotides of successful display molecules harvested, while a 
second portion may be contacted with a blank vessel or a second target and 
the identifier oligonucleotides of successful display molecules of the second 
portion collected. Prior to the hetero- and homo-duplex formation, the 

1 5 identifier oligonucleotides from the two portions are mixed. The advantages 
of screening two or more portions of a library individually include changing 
the profile of the background, obtaining subtype selectivity of the display 
molecule etc. It is also possible to combine two or more libraries before the 
contacting with the target to obtain an altered profile of binding display 

20 molecules. 

Recovery of homo-duplexes 

The recovery of homo-duplex is typically obtained by removing the hetero- 
duplexes form the mixture of hetero-duplexes and homo-duplexes. In another 
25 aspect of the invention, the homo-duplexes are extracted form the mixture. 
When the hetero-duplexes are removed from the reaction mixture, any 
convenient method can be used, such as a mis-match binding, enzymatic or 
chemical mis-match cleavage, or physical method. 

30 Mis-match binding 

In a certain aspect of the invention, hetero-duplexes are removed by binding 
to prokaryotic or eukaryotic mis-match binding proteins. An example is MutS, 
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a mismatch binding protein isolated from E. coii, which recognisesj-egions of 
double-stranded DNA containing a mismatched base pair (Wagner el al., 
1995, Nucleic Acids Research, 22, 1541-1547) as well as 1 to 4 base pair 
insertion-deletion loops. MutS is allowed to bind to the hetero-duplexes and 

5 bound hetero-duplex/MutS complexes are removed from the reaction mixture 
using, for example, powdered nitrocellulose. A convenient alternative is to 
use MutS conjugated to magnetic beads, allowing bound heteroduplexes to 
be removed from the reaction mixture with a magnet. MutS may also be 
conjugated to biotin and the bound hetero-duplexes removed from the 

10 mixture using streptavidin-coated beads. 

MutY has considerable potential for mismatch detection, as its in vivo 
function is to repair mismatched G:A base pairing by cleavage of the 
adenine-containing strand. Similar proteins thought to be involved in G:T and 
15 G:U mismatch repair have also been described. Hsu (Carcinogenesis 

1994;15:1657-62) described the use of E. coii MutY protein for the detection 
of mismatched G:A in p53. As low as 1-2% mutant DNA in a sample of 
mutant and wild-type DNA could be detected using a synthetic DNA 
oligonucleotide to create G/A mis-pairing. 

20 

Enzymatic cleavage 

Chemical and enzymatical cleavage methods used for degradation of hetero- 
duplexes must differentially cleave these sequences and retain the homo- 
duplexes. Any sequence difference will result in the formation of a mispairing, 

25 causing localized distortion of the double helix. Cleavage techniques exploit 
this structural change by selectively degrading or modifying DNA at the site 
of the mismatch. Ideally, little or no cleavage would be seen in a perfectly 
matched DNA fragment, and all distortions of the helix generated by base 
mismatches would result in cleavage. In practice neither criteria a fully met, 

30 and the utility of a technique becomes a trade-off between ease of use, 
sensitivity and specificity. 
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In certain aspect of the invention mammalian or bacterial endonucleases are 
used to recognise and cleave the hetero-duplexes at mismatched 
nucleobases (see U.S. Pat. No. 5,824,471). Examples of preferred enzymes 
include bacteriophage resolvases such as T4 endonuclease VII orT7 
5 endonuclease L In a preferred aspect of the invention, thermostable cleavage 
enzymes would be used in order to avoid the necessity of adding fresh 
enzyme during each round of heteroduplex formation and removal. 

An enzyme called "Cleavase" from Third Wave Technologies relies on the 
1 0 endonucleoiytic cleavage of stem-loop structures. The precise nature of the 
thermostable enzyme "Cleavase" has not been published. Since the stem- 
loop profile is dependent upon the primary sequence of the DNA, sequence 
changes in some cases result in a change in the cleavage profile. This 
method is unique in that unlike all other enzymatic techniques, it does not 
1 5 require the formation of a mismatch heteroduplex to generate a site for 
enzymatic cleavage. 

Ribonuclease A cleavage was originally described by Myers et al (Science 
1985; 230:1242-6) using DNA:RNA hybrids. Sensitivity was reported to be 

20 around 60% per strand cleaved. Grange et al (Nucleic Acids Research 1990; 
18:4227-36) described improved sensitivity by screening both strands of 
RNA. A Non-lsotopic RNase Cleavage Assay (NIRCA) has also been 
described. This assay (commercially available from Ambion) utilized PCR 
primers with phage RNA polymerase promoters so that large quantities of 

25 RNA were produced. Cleaved products could be detected on agarose gels 

and fragments of up to 1 kilobase were analyzed. Additional RNase enzymes 
such as RNase 1 and RNase T1 increase the sensitivity of the assay. The 
commercial kit includes a helix modifying reagent that makes the mismatches 
more sensitive to cleavage. 

30 

T4 endonuclease VII (T4E7) and T7E1 are small proteins from 
bacteriophages that bind as homodimers and cleave aberrant DNA structures 
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including Holliday Junctions (and are hence sometimes called "resolvases") 
though it is far from clear that they perform such a role in vivo. Mashal et al 
(Nature Genetics 1995; 9:177-83) observed that they preferentially cleaved 
mismatched hetero-duplexes, leading to the possibility of an enzymatic 

5 equivalent to the chemical cleavage of mismatch. DNA requires no special 
preparation after amplification like GC clamping or including primers with 
'phage promoters. Background peaks which are seen are highly reproducible 
and may therefore amenable to background subtraction algorithms like those 
applied to DNA sequencing traces. A commercially available T4E7-based 

1 0 mutation detection kit is available from Amersham-Pharmacia. 



A plant endonuclease (CEL I) with simitar activity has also been described. 
CEL I is one of series of plant endonucleases with similar activity to nuclease 
S1 but at neutral pH instead of pH 4 or 5. Like T4E7, the cleavage efficiency 
1 5 varies according to the mismatch examined and background cleavage is 
dependent on the template being examined. 

Uracil glycosylase and photo-activated guanine modification reagent have 
been used to develop a cleavage method that essentially produces T or G 
20 sequencing tracks. DNA synthesis by PCR requires the incorporation of a 
proportion of uracil bases in place of thymines. These can be removed by 
uracil glycosylase and the abasic site then cleaved by heat or enzymatic 
treatment. 

25 The present invention also includes any combination of enzymes. 
Chemical cleavage 

Chemical cleavage of mismatches was developed as a modification of the 
Maxam-Gilbert DNA sequencing method by Cotton et al (Proc Natl Acad Sci 
30 (US) 1988;85: 4397-401). Mismatched thymines are susceptible to 

modification by osmium tetroxide (or potassium permanganate and tetraethyl 
ammonium acetate) and mismatched cytosines can be modified by 
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hydroxylamine. The modified bases are then cleaved by hot piperidine 
treatment. 



Physical method 

5 The physical method of separating the heterc-duplex and homo-duplex 
molecules involves physical separation, such as achieved by 
chromatography or electrophoresis. Suitable chromatography methods 
include column chromatography, affinity chromatography, size-exclusion 
chromatography and gel chromatography. Suitable gels for gel 

1 o chromatography include non-denaturing gels, such as agarose or 

polyacrylamide gels. The chromatography can be performed at elevated 
temperatures, e.g. above ambient temperatures, to favour the formation of 
homo-duplexes. In a certain aspect of the invention, the temperature is 
selected between the average melting temperature of the homo-duplexes 

1 5 and the melting temperature of the hetero-duplexes having the least amount 
of mis-matches. Generally, a temperature for performing gel chromatography 
is selected in the range of 45 to 80 degrees Celsius. 

Suitable physical methods notably include denaturing high performance liquid 
20 chromatography (DHPLC) and chemical or temperature denaturing 

electrophoresis. Denaturing HPLC is a chromatographic technique capable of 
separating hetero-duplex and homo-duplex DNA molecules in a mixture. The 
mixture is applied to a stationary reverse-phase support and the homo- and 
hetero-duplex molecules are eluted (under thermal or chemical conditions 

25 capable of partially denaturing hetero-duplexes) with a mobile phase 

containing an ion-pairing reagent (e.g. triethylammonium acetate; TEAA) and 
an organic solvent (e.g. acetonitrile; AcN). DHPLC can also allow the direct 
quantisation of relative homo-duplex and hetero-duplex concentrations by the 
detection of ultraviolet absorbance or fluorescent emission of/from the 

30 separated species. The area under the absorbance/emission peak is 

proportional to the amount of product, which therefore allows quantitative 
assessment of the relative proportions. DHPLC is described in Liu W et al. 
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(Nucleic Acids Research. 26:1396-1400, 1998 and O'Donovan MC et al. 
Genomics. 52:4449, 1998). 

A. preferred method for use in the instant invention to separate hetero-duplex 
5 and homo-duplex molecules is as described in U.S. Pat. No. 5,795,976, 
which is incorporated herein by reference. 

Determining the identifier oligonucleotide sequence 

The nucleotide sequence of the identifier sequence is determined to identify 

10 the identity of the binding display molecule(s). In a certain embodiment of the 
invention, chemical entities that participated in the formation of the display 
molecules that binds to the target are identified. The synthesis method of the 
display molecule may be established if information on the chemical entities 
as well as the point in time they have been incorporated in the display 

1 5 molecule can be deduced from the identifier oligonucleotide. It may be 
sufficient to obtain information on the chemical structure of the various 
chemical entities that have participated in the formation of the display 
molecule to deduce the full molecule due to structural constraints during the 
formation. As an example, the use of different kinds of attachment 

20 chemistries may ensure that a chemical entity can only be reacted at a single 
position on a scaffold. Another kind of chemical constrains may be present 
due to steric hindrance on the scaffold molecule or the chemical entity to be 
transferred, in general however, it is preferred that information can be 
inferred from the identifier oligonucleotide sequence that enable the 

25 identification of each of the chemical entities that have participated in the 
formation of the encoded molecule along with the point in time in the 
synthesis history the chemical entities have been incorporated in the 
(nascent) display molecule. 

30 Although conventional DNA sequencing methods are readily available and 
useful for this determination, the amount and quality of isolated bifunctional 
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molecules may require additional manipulations prior to a sequencing 
reaction. 

Where the amount is low, it is preferred to increase the amount of the 
5 oligonucleotide sequences by polymerase chain reaction (PCR) using PCR 
primers directed to primer binding sites present in the identifier 
oligonucleotide sequence. 

In one embodiment, the different identifier oligonucleotide sequences are 
10 cloned into separate sequencing vectors prior to determining their sequence 
by DNA sequencing methods. This is typically accomplished by amplifying 
the different identifier oligonucleotide sequences by PCR and then using a 
unique restriction endonuclease sites on the amplified product to directionally 
clone the amplified fragments into sequencing vectors. The cloning and 
15 sequencing of the amplified fragments then is a routine procedure that can 
be carried out by any of a number of molecular biological methods known in 
the art. 

Alternatively, the bifunctional complex or the PCR amplified identifier 
20 oligonucleotide sequence can be analysed in a microarray. The array may be 
designed to analyse the presence of a single codon or multiple codons in a 
identifier oligonucleotide sequence. 



Another approach, the identifier oligonucleotide product is analysed by 
25 QPCR. Preferably, the QPCR affords information as to the chemical moieties 
that has participated in the formation of the display molecules. The QPCR 
approach also allows a direct investigation of the enrichment factor if two 
samples are analysed in parallel, said samples being collected before and 
after the use of the present method. Various conditions can be investigated 
30 to obtain the most optimal selection procedure before the sequences are 
analysed to identify the precise structures of the binding molecules. 
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A still further method for decoding the identifier oligonucleotides comprises 
high throughput sequencing, such as the Pyrosequencing method of 454 
Lifesciences. 

5 According to the invention, the homo-duplexes are used to deduce the 
identity of the display molecule(s) interacting with a target. The 
homoduplexes may be used directly from the recovery step or modified by 
any biotechnological technique prior to decoding. Notably, the modification 
can include total or partial amplification of the homo-duplexes to produce a 
10 double stranded or single stranded product. Also the modification may 
include fragmentation, e.g. digestion by a restriction enzyme or another 
nucleic acid active enzyme. In a certain embodiment, a restriction site is 
positioned between codons to allow for a separation of codons of the 
identifier oligonucleotides. The fragmentation may facilitate the subsequent 

15 decoding, as small nucleic acids usually are easier to decode. 

The recovered duplexes of step e may one or more times be recycled to step 
d. The recycling may reduce the diversity at the nucleic acid level and at the 
same time increase the probability that an identifier oligonucleotide from a 

20 display molecule interacting with a target is identified when sequencing a 

limited number of homo-duplexes. Usually, when two or more sequences of a 
single identifier are detected during the sequencing, this is an indication that 
a display molecule performing an interaction with the target is identified. The 
repetition of the hetero- and homo-duplexes formation and recovery of the 

25 homo-duplexes may be conducted a suitable number of times until two or 
more identifier oligonucleotides in a sequencing step of a limited number of 
homo-duplexes is revealed. 

In a certain embodiment of the invention, therefore, a decoding of the homo- 
30 duplex is effected prior to a consecutive repetition. The information of the 
decoding step may be used to modify the composition of the identifier 
oligonucleotides and strands complementary thereto. Notably, the 
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composition can be modified by removing certain identifier oligonucleotides 
from the pool. As an example, identifiers of known display molecules 
interacting with a target can be removed to reveal other display molecules in 
the library having an ability to perform the same interaction albeit, to a lesser 
5 extend. 

Various methods for excluding certain identifier oligonucleotides are 
available, including removal by an immobilized probe, digestion with a 
sequence specific nuclease etc. An appealing method includes that the 

1 0 partitioned fraction of identifier oligonucleotides is split in two portions. A first 
portion is then treated as described above to recover homo-duplexes. The 
recovered homo-duplexes are used to treat the second portion. In a certain 
aspect of the invention it is preferred to amplify the recovered homo-duplexes 
using primers conjugated to biotin. When the amplification product containing 

15 biotin is mixed with the second portion of the partitioned identifiers under 

denaturing conditions and subsequently subject to at least partly renaturing 
conditions, the strands with attached biotin may anneal with a complement in 
the mixture. A subsequent treatment with streptavidin or avidin allows the 
biotin labelled duplexes selectively to be removed from the mixture. 

20 

Preferred embodiments of the present invention 

The present invention relates to a method for identifying a display molecule of a 
Afunctional complex further comprising an identifier oligonucleotide linked to the 
25 display molecule, said method comprising the steps of 

i) separating the complementary single stranded identifier oligonucleotides from an 
enriched and/or partitioned fraction of Afunctional complexes having an affinity for a 
target molecule, said bifunctional complexes comprising a duplex identifier 

30 oligonucleotide comprising said complementary single stranded identifier 
oligonucleotides, 

ii) hybridising the single stranded identifier oligonucleotides obtained in step i), 
thereby generating a composition comprising homo-duplex identifier 
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oligonucleotides and hetero-duplex identifier oligonucleotides, wherein homo-duplex 
identifier oligonucleotides hybridise without generating any mis-matches between 
the complementary oligonucleotide strands, and wherein one or more mis-matches 
are present when complementary identifier oligonucleotides form hetero-duplexes, 
5 iii) separating homo-duplexes from hetero-duplexes, 

iv) obtaining a fraction comprising a higher percentage of homoduplexes than in the 
fraction used in step i), such as obtaining a fraction comprising predominantly homo- 
duplexes, 

v) decoding the identifier oligonucleotides of the homo-duplexes, and 

10 vi) identifying based on the decoding in step v) the one or more display molecules 
identifier by the decoded identifier oligonucleotides. 

Said enriched and/or partitioned fraction of bifunctionai complexes is preferably 
obtained from a naive "starting library" which has been through at least one 
15 partitioning and/or selecting step, such as 2, 3, 4, 5, 6 or 7 partitioning and/or 

selecting steps, such as 4 partitioning and/or selecting steps, to form the enriched 
and/or partitioned fraction of bifunctionai complexes. 

Steps i) to iii) described above may be repeated one or more times to further enrich 
20 the enriched and partitioned fraction. 

After step iv), an optional step may be added in which the identifier nucleotides from 
the fraction obtained in step iv) are (optionally after amplification) used to contact 
either the fraction used in step i) (or a partitioned variant thereof), and/or the original 

25 naive library used to generate the fraction used in step i) (or a partitioned variant 
thereof). Those bifunctionai molecules from these earlier libraries identified as 
having affinity to the nucleotide tag of the fraction obtained in step iv) (or to the 
complementary sequence of the nucleotide tag) may then be used to enrich the 
library obtained in step iv). This optional step may be repeated as many times as is 

30 required to obtain the required enrichment of the library, optionally after repeating 
steps i)-iii) again. 

Preferably, the duplex identifier oligonucleotide comprising complementary single 
stranded identifier oligonucleotides is obtained by hybridisation of a complementary 
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identifier oligonucleotide with the identifier oligonucleotide linked to the display 
molecule of the bifunctional complex. 

Equally preferably, the duplex identifier oligonucleotide comprising complementary 
single stranded identifier oligonucleotides is obtained by hybridising a probe 
5 oligonucleotide with the identifier oligonucleotide of the bifunctional complex, 
enzymatically extending said probe oligonucleotide, thereby obtaining a 
complementary identifier oligonucleotide hybridised with the identifier 
oligonucleotide linked to the display of the bifunctional complex. 

1 0 The duplex identifier oligonucleotide comprising complementary single stranded 
identifier oligonucleotides may be obtained prior to, during, concomitantly with or 
after the step of partitioning the bifunctional complexes. 

Preferably the identifier oligonucleotide linked to the display molecule is selected 
1 5 from the group consisting of a) a single stranded identifier oligonucleotide and b) a 
duplex identifier oligonucleotide comprising complementary single stranded identifier 
oligonucleotides, said method comprising the preliminary steps (i.e. preferably prior 
to step (i), described above) of 

20 a) providing a composition of bifunctional complexes comprising different display 
molecules, 

b) providing one or more target molecule(s) having an affinity for one or more of said 
display molecules (wherein said one or more target molecules may be different 
and/or the same), 

25 c)contacting the composition of bifunctional complexes provided in step i) with the 
target molecule provided in step ii) 

d) obtaining an enriched fraction of bifunctional complexes comprising one or more 
display molecules having an affinity for said target molecule, 

e) partitioning said enriched fraction obtained in step iv) from bifunctional complexes 
30 not having an affinity for said target molecule, and 

f) complementing single stranded identifier oligonucleotides of different bifunctional 
complexes in which the display molecule is linked to a) a single stranded identifier 
oligonucleotide, thereby obtaining a duplex identifier oligonucleotide comprising 
complementary identifier oligonucleotides, with the proviso that no single stranded 
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complementation occurs for bifunctional complexes comprising b) duplex identifier 
oligonucleotides comprising complementary identifier oligonucleotides. 

After step d), an optional step may be added in which the identifier nucleotides from 
5 the fraction obtained in step d) are (optionally after amplification) used to contact 
either the composition used in step a) (or a partitioned variant thereof), and/or the 
original naTve library used to generate the composition used in step i) (or a 
partitioned variant thereof). Those bifunctional molecules from these earlier libraries 
identified as having affinity to the nucleotide tag of the fraction obtained in step d) (or 
10 to the complementary sequence of the nucleotide tag) may then be used to enrich 
the library obtained in step d). This optional step may be repeated as many times as 
is required to obtain the required enrichment of the library, optionally after repeating 
steps a)-d) again. 

1 5 The display molecule of a bifunctional complex provided in step a) or obtained in 
step d) may be linked to a single stranded identifier oligonucleotide, wherein said 
method further comprises the step of complementing said single stranded identifier 
oligonucleotides of the different bifunctional complexes, thereby obtaining a duplex 
identifier oligonucleotide comprising complementary identifier oligonucleotides. Said 

20 step of complementing said single stranded identifier oligonucleotide may occur 
prior to, during or concomitantly with, or after partitioning of the bifunctional 
complexes. The complementing step may comprise the steps of hybridising one or 
more oligonucleotide probe(s) to the single stranded identifier oligonucleotide of at 
least some of the bifunctional complexes, and ligating and/or extending 

25 enzymatically said probe(s), thereby generating a duplex identifier oligonucleotide 
comprising complementary oligonucleotide strands. 

Also, the complementation of the single stranded identifier oligonucleotides may 
comprise the step of hybridising a complementary identifier oligonucleotide to the 
single stranded identifier oligonucleotide of at least some of the bifunctional 
30 complexes. 

In one embodiment, the display molecule of at least some of the bifunctional 
complexes are linked to duplex identifier oligonucleotides comprising 
complementary oligonucleotide strands. 
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In one embodiment of the method disclosed herein, hetero-duplexes are selectively 
removed or eliminated from the composition comprising homo-duplex identifier 
oligonucleotides and hetero-duplex identifier oligonucleotides. This may e.g. be 
earned out by enzymatically degrading the mis-matched, single stranded part of the 
5 hetero-duplexes. The enzyme used may e.g. comprise nuclease activity or consist 
of a polypeptide having nuclease activity, for example an enzyme selected from T4 
endonuclease VII, T4 endonuclease I, CEL I or nuclease S1, including variants and 
combinations thereof. Said enzyme is preferably thermostable. 

1 0 In one embodiment of the present invention, the chemical entities are precursor 
chemical entities which are processed in the synthesis of the display molecule, 
wherein the processed chemical entities form part of the display molecule. 

in one embodiment of the present invention, each codon comprises 4 or more 
15 nucleotides. 

The chemical entity reactions may occur without the intervention of an enzyme. 

In one preferred embodiment, the linker linking the display molecule and the 
20 identifier oligonucleotide is selectively cleavable. For example, the linker may be 

cleaved by irradiation. The cleavage preferably occurs after the bifunctional complex 
has contacted the target. 

The composition of bifunctional complexes may comprise more than 1000 different 
25 display molecules. 

In one embodiment of the present invention, the target molecule is of a biological 
origin. The molecular target may be immobilized on a solid support, for example the 
immobilized target may form a stable or quasi-stable dispersion. The target 

30 molecule may comprise or consist of a polypeptide, such as selected from the group 
consisting of kinases, proteases, phosphatases, and antibodies. In another 
embodiment of the present invention, the target molecule may comprise or consist 
of a nucleic acid, such as consisting of a nucleic acid. Said nucleic acid may e.g. be 
a DNA or RNA aptamer. In one preferred embodiment of the present invention, the 

35 target molecule, such as a polypeptide, is associated with the nucleic acid template 
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having templated the synthesis thereof. Further suitable target molecules for use in 
the present invention include, but are not limited to : 

(1) Receptors, e.g.,andrenergic receptors, histamine receptors, dopamine receptors, 
alpha-V-beta-3 integrin 

(2) Kinases, e.g., ROCK-1, CaM kinase, cdk5, MAPKAP kinase, PKC gamma, p38, 
PRAK, mTOR, Raf, WNK3 

(3) Polymerases, e.g., HIV reverse transcriptase, 

(4) Transcription factors, e.g., BRCA1, p53, 

(5) Proteases, e.g., HIV protease, thrombin, 

(6) Phosphatases, e.g., PTP 1B, Cdc25A, (7) Cytokines, e.g., TNF-alpha, 

(8) Antibodies, e.g., antibodies raised against any of the mentioned targets or other 
macromolecules. 

Furthermore, suitable targets may have two or more of the above functions 

in one preferred embodiment, the contacting of the target molecule and the 
composition of different bifunctional complexes is achieved by mixing the target 
molecule with the composition of different bifunctional coplexes. In such cases, the 
target is optionally saturated with a known ligand prior to contacting the different 
bifunctional complexes. 

In one embodiment of the present invention, at least one of the complementary 
identifier oligonucleotides of the recovered homo-dupiexes is amplified prior to the 
step of decoding the identity of the display molecule. 

The partitioned fraction of identifier oligonucleotides may e.g. be amplified by PCR, 
such as being proportionally amplified. 

In one embodiment of the present invention, the complementary identifier 
oligonucleotide strands of homo-duplexes and hetero-duplexes in the fraction 
comprising predominantly homo-duplexes are separated and allowed to hybridise, 
said hybridisation resulting in a further fraction comprising predominantly homo- 
duplexes. Furthermore, the complementary identifier oligonucleotide- strands of 
homo-duplexes and hetero-duplexes in the fraction comprising predominantly homo- 
duplexes may optionally be amplified before being re-hybridised and/or the identifier 
oligonucleotides may be decoded before being re-hybridised. Said decoding may be 
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used to eliminate a subset of identifier oligonucleotides before the step of re- 
hybridisation. 

Preferred methods for obtaining the display molecule useful in the inventive 
5 methods described herein 

Any suitable method may be used for generating the display molecule useful in the 
inventive methods described herein. In said method for generating the display 
molecules for use in the present invention, it is preferred that at least 3 chemical 

1 0 entities, such as at least 4 chemical entities, for example at least 5 chemical entities, 
such as at least 6 chemical entities, for example at least 7 chemical entities, such as 
at least 8 chemical entities, for example at least 9 chemical entities, such as at least 
10 chemical entities, for example at least 11 chemical entities, such as at least 12 
.chemical entities, for example at least 13 chemical entities, such as at least 14 

1 5 chemical entities, for example at least 15 chemical entities, such as at least 16 
chemical entities, for example at least 17 chemical entities, such as at least 18 
chemical entities, for example at least 19 chemical entities, such as at least 20 
chemical entities, are reacted. 

20 In one embodiment of the method disclosed herein, the display molecule is obtained 
by reacting two or more chemical entities, wherein each chemical entity is preferably 
linked to a building block comprising one or more anti-codons, wherein said anti- 
codon(s) is complementary to an identifier oligonucleotide codon(s), wherein said 
codons or anti-codons identifies the chemical entities having reacted to generate the 

25 display molecule. Said codons may be separated by a framing sequence. 

Templated molecules and methods for using such molecules 
The display molecule useful in the inventive method described herein may in one 
embodiment be obtained by a method comprising the steps of 
30 i) providing at least one template comprising a sequence of n coding elements, 

wherein each coding element comprises at least one recognition group capable of 
recognising a predetermined complementing element, and 



35 



wherein n is an integer of more than 1, 
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ii) providing a plurality of building blocks, wherein each building block comprises 

a) at least one complementing element comprising at least one recognition group 
5 capable of recognising a predetermined coding element, 

b) at least one functional entity comprising at least one functional group and at least 
one reactive group, and 

10 c) at least one linker separating the at least one functional entity from the at least 
one complementing element, 

iii) contacting each of said coding elements with a complementing element 
capable of recognising said coding element, 

15 

iv) optionally obtaining a complementing template by covalently 

linking the complementing elements, and 

v) obtaining a display molecule by reacting functional groups 
20 on different building blocks, 

wherein the display molecule is linked to a building block or to the complementing 
template, when formed, or to the template that templated the synthesis of the 
display molecule. Further embodiments of this method that are suitable for use in 
25 the present invention are described in PCT application with publication number WO 
02/103008 ("Templated Molecules and Methods for Using such Molecules"), the 
contents of which are incorporated herein by reference. 

Ligational encoding of Small Molecules 

30 

In another embodiment of the present invention, said display molecule may be 
obtained by a method comprising the steps of 
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i) providing at least one template comprising one or more codons capable of 
hybridising to an anti-codon, wherein said template is optionally associated with one 
or more chemical entities, and 

ii) providing a plurality of building blocks each comprising an anti-codon associated 
5 with one or more chemical entities, 

iii) hybridising the anti-codon of one or more of the provided building blocks to the 
template, 

iv) covalently linking said anti-codons and/or linking the at least one template with 
the anti-codon of at least one building block, thereby generating an identifier 

1 0 oligonucleotide capable of identifying chemical entities having participated in the 
synthesis of the display molecule, 

v) displacing or separating the template from one or more of the anti-codons 
hybridised thereto, thereby generating an at least partly single stranded identifier 
oligonucleotide associated with a plurality of chemical entities, 

1 5 vi) generating a bifunctional complex comprising a display molecule linked to an 

identifier oligonucleotide identifying the chemical entities having participated in the 
synthesis of the display molecule, 

wherein said display molecule is generated by reacting at least two of said plurality 
20 of chemical entities associated with the identifier polynucleotide, 

wherein said at least two chemical entities are provided by separate building blocks 
or provided by the template and at least one building block. 

25 Further embodiments of this method that are suitable for use in the present 
invention are described in PCT application with publication number WO 
2004/083427 ("Ligational encoding of Small Molecules"), the contents of which are 
incorporated herein by reference. 

30 

Quasirandom structure and Function Guided Synthesis Methods 

In another embodiment of the present invention, said display molecule may be 
obtained by a method comprising the steps of 

35 
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i) providing a plurality of building blocks each comprising an oligonucleotide 
associated with one or more chemical entities, 

ii) providing at least one connector oligonucleotide capable of hybridising with one or 
5 more building block oligonucleotides, 

iii) immobilising at least one building block to a solid support, 

iv) hybridising said immobilised building block oligonucleotide to a first connector 
10 oligonucleotide, 

v) hybridising at least one additional building block oligonucleotide to said first 
connector oligonucleotide, 

15 vi) ligating building block oligonucleotides hybridised to the connector 
oligonucleotide, 

vii) separating the connector polynucleotide from the ligated building block 
oligonucleotides, 

20 

viii) reacting one or more chemical entities associated with different building block 
oligonucleotides, thereby obtaining a first bifunctional complex comprising a first 
display molecule or first display molecule precursor linked to a first identifier 
oligonucleotide identifying the chemical entities having participated in the synthesis 

25 of the first display molecule or first display molecule precursor, wherein said first 
bifunctional complex is immobilised to a solid support. 

Said chemical entities are optionally reacted in a reaction compartment from which 
the connector oligonucleotide has been removed in a washing and/or separation 
30 step prior to the reaction of said chemical entities. The method for obtaining one or 
more display molecules may further comprise the steps of: 



i) providing a second connector polynucleotide, 
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ii) hybridising said second connector polynucleotide to the identifier polynucleotide 
of said first bifunctional complex, 

iii) hybridising at least one further oligonucleotide of a building block to said second 
5 connector oligonucleotide, 

iv) ligating building block oligonucleotides hybridised to the second connector 
oligonucleotide, wherein at least one of said building block oligonucleotides are 
hybridised to the first identifier polynucleotide, 

10 

v) separating the second connector polynucleotide from the ligated building block 
oligonucleotides, preferably by diverting the second connector polynucleotide to 
another compartment, 

1 5 vi) reacting the first display molecule precursor with the one or more chemical 
entities associated with the ligated building block oligonucleotide(s), thereby 
obtaining a second bifunctional complex comprising a second display molecule or 
second display molecule precursor linked to a second identifier polynucleotide 
identifying the chemical entities having participated in the synthesis of the display 

20 molecule or display molecule precursor, 

wherein said second bifunctional complex is immobilised to a solid support. 

Furthermore, said steps i) to vi) may be repeated for different connector 
25 oligonucleotides and different further building blocks. 

In another embodiment of the present invention, said display molecule may be 
obtained by a method comprising the steps of 

30 i) providing a plurality of building blocks selected from the group consisting of 

a) building blocks comprising an identifier oligonucleotide linked to one or more 
chemical entities, 
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b) building blocks comprising an identifier oligonucleotide linked to one or more 
reactive groups, and 

c) building blocks comprising an identifier oligonucleotide comprising a spacer 
5 region, wherein said building blocks comprising a spacer region are preferably 

connector polynucleotides to which building blocks of groups a) and b) can 
hybridise, 

ii) generating a hybridisation complex comprising at least n building blocks by 
1 0 hybridising the identifier oligonucleotide of one building block to the identifier 

oligonucleotide of at least one other building block, 

wherein n is an integer of 4 or more 

1 5 wherein at least 3 of said at least n building blocks comprise a chemical entity, 

wherein no single identifier oligonucleotide is hybridised to all of the remaining 
identifier oligonucleotides, 

20 wherein optionally at least one of said building blocks of group c) is immobilised to a 
solid support, thereby providing a handle to which an oligonucleotide of at least one 
building block of groups a) or b) can hybridise, 

iii) covalently linking identifier oligonucleotides of building blocks comprising one or 
25 more chemical entities, thereby obtaining an identifier oligonucleotide comprising 

covalently linked identifier oligonucleotides each associated with one or more 
chemical entities, 

iv) optionally separating said identifier oligonucleotide obtained in step iii) from any 
30 immobilised connector oligonucleotides hybridised thereto, wherein said separation 

optionally comprises the step of diverting said identifier polynucleotide comprising 
covalently linked identifier oligonucleotides each associated with one or more 
chemical entities to a different reaction compartment, thereby separating said 
identifier polynucleotide from said immobilised connector oligonucleotides, 
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y) reacting said at least 3 chemical entities linked to the identifier polynucleotide, 
and 

vi) obtaining a bifunctional complex comprising a display molecule resulting from the 
5 reaction of a plurality of chemical entities, wherein said display molecule is linked to 
an identifier polynucleotide identifying the chemical entities having participated in the 
synthesis of the molecule. 

The method steps may be repeated for different, or the same, building blocks. 

10 

Further embodiments of the above methods that are suitable for use in the present 
invention are described in PCT application with publication number WO 
2004/056994 ("Quasirandom structure and function guided synthesis methods"), the 
contents of which are incorporated herein by reference. 

Enzymatic Encoding 

In another embodiment of the present invention, said display molecule may be 
obtained by a method comprising the steps of 

20 

i) providing an oligonucleotide comprising a priming site linked to a) a reaction site 
comprising one or more reactive group(s) and b) a site for tag addition, 

ii) providing a plurality of building blocks each comprising a probe oligonucleotide 
25 capable of hybridising with the priming site, wherein at least part of said plurality of 

building blocks comprise a probe oligonucleotide linked to different chemical entities, 

iii) sequentially hybridising different building blocks to the priming site of the 
oligonucleotide provided in step i), 

30 

iv) reacting for each hybridised building block a chemical entity with a reactive group 
of the reaction site of the oligonucleotide provided in step i), and 
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v) adding for every chemical entity reacted a unique tag residue to the site for tag 
addition, said unique tag identifying the chemical entity having reacted with a 
reactive group of the reactive site. 

5 In another embodiment of the present invention, said display molecule may be 
obtained by a method comprising the steps of: 

i) providing an oligonucleotide comprising a tag addition site and either 

one or more reactive site forfunctionalization, or 
a scaffold comprising one or more reactive site(s) for 
10 functionalization, 

ii) reacting one or more scaffold reactive sites with two or more chemical entities, 
and 

15 iii) adding for each chemical entity reacted with a scaffold reactive site a unique tag 
residue to the tag addition site, thereby obtaining a Afunctional complex comprising 
a display molecule resulting from the chemical entity reactions, said display 
molecule being linked to an identifier oligonucleotide comprising the individual 
unique tag residues. 

20 

Further embodiments of the above methods that are suitable for use in the present 
invention are described in PCT application with publication number WO 
2004/039825 ("Enzymatic Encoding"), the contents of which are incorporated herein 
by reference. 

25 

Display molecules 

Any suitable display molecule may be used in the methods of the present invention. 
For example, the plurality of display molecules may be selected from the group 

30 consisting of peptides wherein the amino acid residues are in the L-form or in the D- 
form, vinylogous polypeptides, glycopolypeptides, polyamides, vinylogous 
sulfonamide peptides, polysulfonamides, conjugated peptides comprising e.g. 
prosthetic groups, polyesters, polysaccharides, polycarbamates, polycarbonates, 
polyureas, polypeptidylphosphonates, polyurethanes, azatides, oligo N-substituted 

35 glycines, polyethers, ethoxyformacetal oligomers, polythioethers, polyethylene 
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glycols (PEG), polyethylenes, polydisulfides, polyarylene sulfides, polynucleotides, 
PNAs, LNAs, morphoiinos, oligo pyrrolinones, polyoximes, polyimines, 
polyethyleneimines, polyimides, polyacetals, polyacetates, polystyrenes, polyvinyl, 
lipids, phospholipids, glycolipids, polycyclic compounds comprising e.g. aliphatic or 
5 aromatic cycles, including polyheterocyclic compounds, proteoglycans, and 
polysiloxanes, including any combination thereof, 

wherein each display molecule is synthesised by reacting a plurality of chemical 
entities preferably in the range of from 2 to 200, for example from 2 to 100, such as 
1 0 from 2 to 80, for example from 2 to 60, such as from 2 to 40, for example from 2 to 
30, such as from 2 to 20, for example from 2 to 15, such as from 2 to 10, such as 
from 2 to 8, for example from 2 to 6, such as from 2 to 4, for example 2, such as 
from 3 to 100, for example from 3 to 80, such as from 3 to 60, such as from 3 to 40, 
for example from 3 to 30, such as from 3 to 20, such as from 3 to 1 5, for example 
15 from 3 to 1 5, such as from 3 to 1 0, such as from 3 to 8, for example from 3 to 6, 

such as from 3 to 4, for example 3, such as from 4 to 100, for example from 4 to 80, 
such as from 4 to 60, such as from 4 to 40, for example from 4 to 30, such as from 4 
to 20, such as from 4 to 15, for example from 4 to 10, such as from 4 to 8, such as 
from 4 to 6, for example 4, for example from 5 to 1 00, such as from 5 to 80, for 
20 example from 5 to 60, such as from 5 to 40, for example from 5 to 30, such as from 
5 to 20, for example from 5 to 1 5, such as from 5 to 1 0, such as from 5 to 8, for 
example from 5 to 6, for example 5, such as from 6 to 100, for example from 6 to 80, 
such as from 6 to 60, such as from 6 to 40, for example from 6 to 30, such as from 6 
to 20, such as from 6 to 15, for example from 6 to 10, such as from 6 to 8, such as 
25 6, for example from 7 to 100, such as from 7 to 80, for example from 7 to 60, such 
as from 7 to 40, for example from 7 to 30, such as from 7 to 20, for example from 7 
to 15, such as from 7 to 10, such as from 7 to 8, for example 7, for example from 8 
to 100, such as from 8 to 80, for example from 8 to 60, such as from 8 to 40, for 
example from 8 to 30, such as from 8 to 20, for example from 8 to 1 5, such as from 
30 8 to 10, such as 8, for example 9, for example from 10 to 100, such as from 10 to 
80, for example from 1 0 to 60, such as from 10 to 40, for example from 10 to 30, 
such as from 10 to 20, for example from 10 to 15, such as from 10 to 12, such as 
10, for example from 12 to 100, such as from 12 to 80, for example from 12 to 60, 
such as from 12 to 40, for example from 12 to 30, such as from 12 to 20, for 
35 example from 12 to 15, such as from 14 to 100, such as from 14 to 80, for example 
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from 14 to 60, such as from 14 to 40, for example from 14 to 30, such as from 14 to 
20, for example from 14 to 16, such as from 16 to 100, such as from 16 to 80, for 
example from 16 to 60, such as from 16 to 40, for example from 16 to 30, such as 
from 16 to 20, such as from 18to100, such as from 18 to 80, for example from 18to 
5 60, such as from 1 8 to 40, for example from 1 8 to 30, such as from 1 8 to 20, for 

example from 20 to 100, such as from 20 to 80, for example from 20 to 60, such as 
from 20 to 40, for example from 20 to 30, such as from 20 to 25, for example from 
22 to 100, such as from 22 to 80, for example from 22 to 60, such as from 22 to 40, 
for example from 22 to 30, such as from 22 to 25, for example from 25 to 1 00, such 

1 0 as from 25 to 80, for example from 25 to 60, such as from 25 to 40, for example 

from 25 to 30, such as from 30 to 100, for example from 30 to 80, such as from 30 to 
60, for example from 30 to 40, such as from 30 to 35, for example from 35 to 100, 
such as from 35 to 80, for example from 35 to 60, such as from 35 to 40, for 
example from 40 to 100, such as from 40 to 80, for example from 40 to 60, such as 

1 5 from 40 to 50, for example from 40 to 45, such as from 45 to 1 00, for example from 
45 to 80, such as from 45 to 60, for example from 45 to 50, such as from 50 to 100, 
for example from 50 to 80, such as from 50 to 60, for example from 50 to 55, such 
as from 60 to 100, for example from 60 to 80, such as from 60 to 70, for example 
from 70 to 100, such as from 70 to 90, for example from 70 to 80, such as from 80 to 

20 100, for example from 80 to 90, such as from 90 to 1 00. 

The display molecules may be polypeptides that are not alpha-polypeptides for 
example. In another embodiment of the present invention, the display molecules are 
natural or non-natural nucleic acids. In one embodiment, it is preferred that the 
25 display molecules have a molecular weight of less than 2000 Dalton, such as less 
than 1000 Dalton, for example less than 500 Dalton. 

One or more chemical entities may be transferred to a display molecule or an 
intermediate thereof by a chemical building block further comprising an anti-codon, 
30 such as whereby the information of the anti-codon is transferred in conjunction with 
the chemical entity to the Afunctional complex. 



Any of the methods described herein may be repeated one or more time to 
increase the level of partitioning and/or enrichment, such as by repeating 
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said method once, twice, thrice, four times, five times, six times, seven times 
or more than seven times. 

DETAILED DISCLOSURE OF THE FIGURES 

5 Figure 1 discloses the common selection process in schematic form. In 
diagram A an ideal library is shown that comprises- a variety of different 
members of a library (Diversity) in the same concentration, i.e. with the same 
number of each bifunctional complex in the mixture. 

10 The library represented in diagram A is subjected to a selection process. 

Generally, the selection process implies that the library is contacted with a 
target to allow for an. interaction, usually a binding interaction, to take place. 
Ideally, the non-binding members of the library are discharged and the 
bifunctional complexes able to perform an interaction with the target are 

1 5 selected. However, in reality a certain amount of non-binding complexes are 
eluted together with the binding complexes. The non-binding complexes are 
referred to a background in diagram B. The background usually increases 
with the diversity, i.e. a large library generates a higher background relative 
to small library. 

20 

To increase the probability of finding a ligand, it is generally desired to apply 
a library as large as possible. The high background however generates a 
high level of noise so that a detection of the hits is difficult or even 
impossible. The present invention suggests a method to reduce the 

25 background so as to be able to identify the hit. In diagram C the background 
is broken down to individual molecules to illustrate that the amount of the 
binding ligand is higher than each of the molecules in the selected library. 
The imbalance formed due to the selection process between the hit and the 
remainder of the library members is then used in the subsequent steps of the 

30 present method. 
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Fig. 2 schematically discloses a homo-duplex and a hetero-duplex. A homo- 
duplex is an identifier oligonucleotide hybridised to a fully complementing 
'. oligonucleotide. A hetero-duplex is an identifier oligonucleotide hybridised to 
an oligonucleotide showing one, two, or more mis-matching nucleotides. A 
5 mis-matching nucleotide is a nucleotide not paired in accordance with the 
Watson-Crick base-pairing rules, in which A pairs with T (or U) and C pairs 
with G. In Fig. 2, the mis-matching nucleotide(s) are illustrated by a filled 
circle. 

10 The homo- and hetero-duplexes may be formed by denaturing a PCR 

product of the result of the selection shown on Fig. 1 and subsequently allow 
the mixture to hybridise again. The denaturing is usually performed by 
heating to a temperature above the melting point of the PCR product. The 
hybridisation conditions are usually obtained by lowering the temperature 

1 5 well below the melting point of the homo-duplexes. 

Fig. 3 illustrates various steps of the present invention. In a first step the 
library is subjected to a selection process as described in Fig. 1 , in which the 
straight line depicts the identifier oligonucleotide from the complex of the 

20 binding display molecule. The output of the selection process is in a 

subsequent step subjected to a melting and reannealing step. Due to the 
excess of identifier oligonucleotides from the binding display molecule, the 
mixture of homo- and hetero-duplexes comprises a relatively higher content 
of homo-duplexes from the identifiers of the binding display molecules than 

25 homo-duplexes from other sources. In the final step shown on Fig. 3, the 

homo-duplexes are separated from the mixture, e.g. by cleavage of hetero- 
duplexes with an enzyme. The result is an enrichment of identifier 
oligonucleotides from binding display molecules. 

30 Fig. 4 discloses a diagram of the overall principle of mis-match selection. 

Initially, a library of bifunctional molecules is subjected to a selection process. 
According to the broadest scope of the invention, any type of bifunctional 
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complex library can be used, including phage display, ribosome display, and 
small molecule display. Following the selection, the identifier oligonucleotide 
is amplified using PCR or a similar method in order to generate more copies 
of the individual identifier oligonucleotides. Preferably, the amplification 
5 retains the proportion between the individual identifier oligonucleotides. 

A number of times the following cycle can be repeated: The PCR ampiicons 
are heated to a temperature above the melting point of the homo-duplexes 
and subsequent cooled to form a mixture of hetero-duplexes and homo- 

10 duplexes. The homo-duplexes are recovered from the mixture by i) binding 
the mis-matched duplexes to a protein, such as MutS, ii) cleaving the hetero- 
duplex using an appropriate enzyme, such as Cel I, or iii) physical means, 
such as DHPLC. If the result of the mis-match selection still comprises too 
much noise, i.e. it is not possible when sequencing a small amount of 

1 5 sequences to deduce one or more sequences, which occurs more frequently 
■ than others, the cycle may be repeated, starting with a PCR amplification of 
the output of the mis-match selection. 

The final step of the method includes an analysis of the output. A variety of 
20 techniques are available to the skilled person in the art, including sequencing 
using capillary electrophoresis, bead array, high-density microarray etc. If the 
mis-match selection has been successful, an analysis of a relatively few 
oligonucleotides will reveal which display molecules of the library that have 
the highest binding affinity, because the identifier oligonucleotides of 
25 complexes having display molecules with high binding affinity occurs more 
frequent. 

As item 1 there is provided a method for obtaining display molecule(s) having 
affinity towards a target, comprising the steps of 
30 a. providing a library comprising a plurality of different display molecules, 

each display molecule being associated with an identifier 
oligonucleotide, which codes for the identity of said display molecule, 
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b. contacting the library with a target to allow for an interaction between 
the display molecules of the library with the target, 

c. partitioning a fraction enriched in identifier oligonucleotides of display 
molecules interacting with the target, 

5 d. subjecting the fraction to conditions at which hetero-duplexes and 

homo-duplexes are formed, 

e. recovering the homo-duplexes, and 

f. deducing from the homo-duplexes the identity of the display 
molecule(s) interacting with the target. 

10 

Item 2 concerns the method of item 1, wherein in step c the identifier 
oligonucleotides of the library members are provided in homo-duplex form. 
Item 3 concerns the method of items 1 and 2, wherein in step d, the 
conditions effect separation of the strands of the duplex identifier 

15 oligonucleotides, and subsequently hybridisation. 

Item 4 concerns the method of any of items .1 to 3, wherein the homo- 
duplexes in step e are recovered by removal of hetero-duplexes. 
Item 5 concerns the method of item 4, wherein the hetero-duplexes are 
removed by enzymatically degradation. 

20 Item 6 concerns the method of item 5, wherein the enzyme is a nuclease, 
Item 7 concerns the method of any of items 4 to 6, wherein the enzyme is 
selected from T4 endonuclease VII, T4 endonuclease I, CEL I, nuclease S1, 
or variants thereof. 

Item 8 concerns the method of any of the items 5 to 7, wherein the enzyme is 

25 thermostable. 

Item 9 concerns the method of item 1, wherein the display molecule is a 
reaction product of two or more chemical entities and the identifier 
oligonucleotide comprises codons identifying the chemical entities. 
Item 10 concerns the method of item 9, wherein the chemical entities are 

30 precursors for a structural unit appearing in the display molecule. 
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Item 1 1 concerns the method of any of the items 9 to 1 0, wherein some or all 
of the chemical entities are not naturally occurring a-amino acids or 
precursors thereof. 

Item 12 concerns the method of item 9, wherein each codon comprises 4 or 
5 more nucleotides. 

Item 1 3 concerns the method of any of the items 1 to 1 2, wherein the display 
molecules of the library are non-a-polypeptides. 

Item 14 concerns the method of any of the items 1 to 13, wherein the display 
molecules of the library are non-nucleic acids. 
10 Item 15 concerns the method of any of the items 1 to 14, wherein the display 
molecule has a molecular weight less than 2000 Dalton, preferably less than 
1000, and most preferred less than 500 Dalton. 

Item 16 concerns the method of any of the preceding items, wherein the 
identifier oligonucleotide uniquely identifies the display molecule. 

1 5 Item 17 concerns the method of any of the items 1 to 1 6, wherein one or 

more chemical entities are transferred to the nascent display molecule by a 
chemical building block further comprising an anti-codon. 
Item 18 concerns the method of item 17, wherein the information of the anti- 
codon is transferred in conjunction with the chemical entity to the nascent 

20 complex. 

Item 19 concerns the method of any of the preceding items, wherein the 
chemical entities are reacted without enzymatic interaction. 
Item 20 concerns the method of any of the items 1 to 1 9, wherein the codons 
are separated by a framing sequence. 
25 Item 21 concerns the method of any of the items 1 to 20, wherein a 

selectively cleavable linker joins the display molecule and the identifier 
oligonucleotide. 

Item 22 concerns the method of item 21, wherein the linker is cleaved by 
irradiation. 

30 Item 23 concerns the method of any of the items, wherein the library 
comprises one, two or more different library members. 
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Item 24 concerns the method of any of the items 1 to 23, wherein the library 
comprises 1 ,000 or more different members. 

Item 25 concerns the method of item 1, wherein the molecular target is of a 
biological origin. 

5 Item 26 concerns the method of any of the items 1 to 25, wherein the 
molecular target is immobilized on a solid support. 

Item 27 concerns the method of item 26, wherein the target immobilized on 
the support forms a stable or quasi-stable dispersion. 

Item 28 concerns the method of items 26 or 27, wherein a cleavable linker is 
10 present between the solid support and the molecular target. 

Item 29 concerns the method of any of the items 1 to 28, wherein the 
molecular target is a protein. 

Item 30 concerns the method of item 29, wherein the protein is selected from 
the group consisting of kinases, proteases, phosphatases, and anti-bodies. 
15 Item 31 concerns the method of any of the items 1 to 29, wherein the 
molecular target and/or the display molecule is a nucleic acid. 
Item 32 concerns the method of item 31, wherein the nucleic acid is a DNA or 
RNA aptamer. 

Item 33 concerns the method of any of the items 29 to 32, wherein the target 
20 protein is attached to the nucleic acid responsible for the formation thereof. 
Item 34 concerns the method of any of the items 1 to 33, wherein the 
contacting step includes that a target is mixed with a library of complexes. 
Item 35 concerns the method of item 34, wherein a target is saturated with a 
known ligand prior to the mixing step. 
25 Item 36 concerns the method of item 1, wherein the recovered homo- 
duplexes of step e are amplified prior to decoding the identity of the display 
molecule. 

Item 37 concerns the method of any of the items 1 to 36, wherein the 
partitioned fraction of identifier oligonucleotides of step c is amplified by PCR 
30 prior to step d. 

Item 38 concerns the method of item 37, wherein the identifier 
oligonucleotides are proportionally amplified. 
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Item 39 concerns the method of any of the items 1 to 38, wherein the 
recovered homo-duplexes of step e one or more times are recycled to step d. 
Item 40 concerns the method of item 39, wherein the recovered homo- 
duplexes are amplified prior to the treatment according to step d. 
5 Item 41 concerns the method of items 38 or 39, wherein a decoding occurs 
before recycling to step d. 

Item 42 concerns the method of any of the items 39 to 41 , wherein the 
information obtained from the decoding is used to modify the composition of 
the identifier oligonucleotides or complements thereof before recycling to 
10 step d. 

Item 43 concerns the method of item 41, wherein the modification includes 
extinction of certain identifiers. 

1 5 Examples 

EXAMPLE 1. Statistical calculations of mismatch selection (MISE) treatment 
in a library after initial selection. 

20 Theoretical procedure: 

This example describes the statistical calculation of MISE treatment 
simulating various libraries with certain size and diversity and different 
enrichment factors in the initial selection process. Below is definition of the 
parameters used in the calculations. 

25 

Steps (x): 

1 : initial library 

2 : library after selection 

3 : library after first MISE round 

30 4 : library after second MISE round 

5 : library after third MISE round 

6 : library after fourth MISE round 
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N(t,x) : Total number of molecules in the library in step x 

N(p,x) : Total number of molecules in pool p in step x 

A(t,x) : Total number of molecules formed in PCR amplification in step x 

5 D(t,x) : Diversity (number of different molecules in library) in step x 

D(p,x) : Diversity of pool p (number of different molecules in library) in 
step x 

R(p) : Relative enrichment factor of pool p 

SO(x) : Fraction of homoduplexes surviving MISE treatment 

10 SE(x) : Fraction of heteroduplexes surviving MISE treatment 

STEP 1: Selection. 

In this step the initial library is contacted with target. Unbound templates are 
washed off and bound templates are recovered. The templates will be 
15 recovered according to the binding efficiency of their displayed ligand to the 
target. 

N(p,1) = N(p,s)*R(p,s) 

20 STEP 2: Calculation of diversity after selection. 

The selection step can reduce diversity of the library if for some of the 

background binders less than 1 molecule survives selection. 

If N(p,1) is less than D(p,s) then N(p,1) is set equal to D(p,s) [loss of 

diversity] 

25 

STEP 3: PCR Amplification of selection output. 

By addition of a known amount of primers in the PCR amplification, the total 
amount of molecules in the library after amplification can be normalized to a 
specific amount [A(t, 1 )] 
30 After amplification N(t, 1 )[ p0 st-ampifficationj equals A(t, 1 ) 

And N(p,1) [ p OS t- amp iif, C ation3 equals N(pJ)[ pr e-amplfficationj * A(t,1) / N(t,1 )[pre-amplification] 



WO 2005/078122 PCT/DK2005/000106 

80 

STEP 4: Homo- and hetero-duplex separation. 

The resulting duplexes are denatured by heating and the individual strands 
are reannealed in a random fashion. Thus the re-formation of duplexes takes 
place stochastically. 
5 The frequency of a species in the total library is calculated by 
F(p,1) = N(p,1)/N(t,1) 

Since all species in a pool behave in the same fashion in this simulation, the 
frequency is equal for all species in a pool. 
10 The number of homoduplexes formed in pool [p] is 
0(p,1) = [N(t,1) * (F(P,1)) A 2 * D(p,1) ] / 2 

The number of heteroduplexes involving strands from pool [p] is 
E(p,1) = 2 * F(p,1) * (1-F(p,1)) * D(p,1) * N(t,1) 

15 

STEP 5: Homo- and hetero-duplex separation. 

All duplexes (homo- and heteroduplexes) are reacted with a mismatch- 
specific enzyme that specifically degrades heteroduplexes. It is expected that 
a small fraction of homoduplexes is also degraded by the mismatch-specific 
20 enzyme. 

The number of surviving homoduplexes in pool p is 
0(p,1) = SO(1) 

If 0{p,1) is then less than zero 0(p,1) is set to zero [less than one molecule 
25 exists] 

The number of surviving heteroduplexes in pool p is 
E(p,1) = SE(1) 

If 0(p,1) is then less than zero 0(p,1) is set to zero 

30 

STEP 6: Amplification 

The total library is normalized to N(t,2) corresponding to a PCR step. 
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Fraction in pool p of number of total molecules: 
Calculated example A 

A model library composed of 5 pools (b-f) that contain species [pool b 
5 contains 1 6 species, pool c contains 8 species etc.] that bind target 

specifically and can be enriched from 0,1 fold (pool f) to 6,25e-3 fold (pool b). 
Pool a contains 1e8 species (minus the sum of the other pools) that bind 
unspecifically and therefore are depleted relatively from the library during 
selection: R(a) = 1e-7. In this case the selection step reduces the number of 
10 molecules in pool a from approximately 1e1 1 to 1e1 1*R(a) = 1e4 and the 
number of molecules in pool f from 1000 to 100. 

N(t,x) = 1e1 1 : Total number of molecules in the library in step x 
15 A(t,x) = 1e1 1 : Total number of molecules formed in PGR amplification in 
step x 

D(t,x) = 1e8 : Diversity (number of different molecules in library) in step x 
D(a,1) = 1e8, D(b,1) = 16, D(c,1) = 8, D(d,1) = 4, D(e,1) = 2, D (f,1) = 1 
R(a) = le-7, R(b) = 6,25e-3, R(c) = 1,25e-2, R(d) = 2,5e-2, R(e) = 5e-2, R (f) 
20 = 1e-1 v 

SO(x) = 0,8 : Fraction of homoduplexes surviving MISE treatment 
SE(x) = 1e-4 : Fraction of heteroduplexes surviving MISE treatment 

Table 1.1. Fraction of molecules in pool p in step x [N(p,x) divided by N(t,x)] 
25 after each amplification step. (Numbers in percent). 



Step Pool (p) 
(x) 







a 


b 


C 


d 


e 


f 


Ib-f 


1 




99,999969 


0,000016 


0,000008 


0,000004 


0,000002 


0,000001 


0,000031 


2 




95,238095 


0,952381 


0,952381 


0,952381 


0,952381 


0,952381 


4,761905 


3 




63,670137 


1,558650 


2,656209 


4,851337 


9,241583 


18,022084 


36,329863 
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0,529398 


0,050570 


0,251337 


1,591524 


11,378731 


86,198439 


99,470602 


5 




0,000177 


0,000019 


0,000189 


0,008976 


0,867189 


99,123449 


99,999823 


6 




0,000000 


0.000000 


0,000000 


0,000002 


0,004046 


99.995951 


100,000000 



As can be seen the library consisting of > 95 % background binders (pool a) 
after selection (step 2) consists of only 0,5 % background binders and 99 % 
binders (the sum of pools b-f [Iw]) after selection and 2 rounds of MISE 
5 (steps 3 and 4) using the described conditions. 

Calculated example B (lower enrichment factor) 

A model library composed of 5 pools (b-f) that contain species [poo! b 

contains 1 6 species, pool c contains 8 species etc.] that bind target 

10 specifically and can be enriched from 0,01 fold (pool f) to 6,25e-4 fold (pool 
b). Pool a contains 1e8 species (minus the sum of the other pools) that bind 
unspecifically and therefore are depleted relatively from the library during 
selection: R(a) = 1e-7. in this case the selection step reduces the number of 
molecules in pool a from approximately 1e1 1 to 1e1 1*R(a) = 1e4 and the 

1 5 number of molecules in pool f from 1000 to 10. 

N(t,x) = 1e1 1 : Total number of molecules in the library in step x 

A(t,x) = 1e1 1 : Total number of molecules formed in PCR amplification in 

step x 

20 D(t,x) = 1e8 : Diversity (number of different molecules in library) in step x 
D(a,1) = 1e8, D(b,1) = 16, D(c,1) = 8, D(d,1) = 4, D(e,1) = 2, D (f,1) = 1 
R(a) = 1e-7, R(b) = 6,25e-4, R(c) = 1,25e-3, R(d) = 2,5e-3, R(e) = 5e-3, R (f) 
= 1e-2 

SO(x) = 0,8 : Fraction of hombduplexes surviving MISE treatment 
25 SE(x) = 1e-4 : Fraction of heteroduplexes surviving MISE treatment 

Table 1.2. Fraction of molecules in pool p in step x [N(p,x) divided by N(t,x)] 
after each amplification step. (Numbers in percent). 
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Step (x) Pool (p) 







3 


b 


c 


d 


e 


f 




1 




yy,yyyyoy 


n OOQQ1 b 

U,UWUU 1 KJ 


o ooonoR 


0 000004 


0,000002 


0,000001 


0,000031 


2 




99,502488 


0,099502 


0,099502 


0,099502 


. 0,099502 


0,099502 


0,497512 


3 




99,088563 


0,099085 


0,106137 


0,141386 


0,211904 


0,352926 


0,911437 


4 




95,128061 


0,095119 


0,111690 


0,234614 


0,763023 


3,667493 


4,871939 


5 




19,132181 


0,019130 


0,025357 


0,114356 


1,807232 


78,901744 


80,867819 


6 




0,008267 


0,000008 


0,000011 


0,000098 


0,026939 


99,964675 


99,991733 



As can be seen the library consisting of > 99,5 % background binders (pool 
a) after selection (step 2) consists of only 19 % background binders and > 80 
% binders (the sum of pools b-f [I b -f]) after selection and 3 rounds of MISE 
5 (steps 3,4 and 5) using the described conditions. 

Calculated example C (Effect of more efficient mismatch cleavage) 
A model library composed of 5 pools (b-f) that contain species [pool b 
contains 16 species, pool c contains 8 species etc.] that bind target 

10 specifically and can be enriched from 0,1 fold (pool f) to 6,25e-3 fold (pool b). 
Pool a contains 1 e8 species (minus the sum of the other pools) that bind 
unspecifically and therefore are depleted relatively from the library during 
selection: R(a) = 1e-7. In this case the selection step reduces the number of 
molecules in pool a from approximately 1e11 to 1e11*R(a) = 1e4 and the 

1 5 number of molecules in pool f from 1 000 to 1 00. 

N(t,x) = 1e1 1 : Total number of molecules in the library in step x 

A(t,x) = 1e1 1 : Total number of molecules formed in PCR amplification in 

step x 

20 D(t,x) = 1e8 : Diversity (number of different molecules in library) in step x 
D(a,1) = 1e8, D(b,1) = 16, D(c,1) = 8, D(d,1) = 4, D(e,1) = 2, D (f,1) = 1 
R(a) = 1e-7, R(b) = 6,25e-3, R(c) = 1 ,25e-2, R(d) = 2,5e-2, R(e) = 5e-2, R (f) 
= le-1 

SO(x) = 0,8 : Fraction of homoduplexes surviving MISE treatment 
25 SE(x) = 1e-5 : Fraction of heteroduplexes surviving MISE treatment 
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Table 1.3. Fraction of molecules in pool p in step x [N(p,x) divided by N(t,x)] 
after each amplification step. (Numbers in percent). 



Step (x) Pool (p) 







a 


b 


c 


d 


e 


f 




1 




99,999969 


0,000016 


0,000008 


0,000004 


0,000002 


0,000001 


0,000031 


2 




95,238095 


0,952381 


0,952381 


0,952381 


0,952381 


0,952381 


4,761905 


3 




39,292097 


2,026837 


3,971975 


7,862268 


15,642835 


31,203988 


60,707903 


4 




0,022674 


0,023500 


0,177898 


1,388845 


10,985145 


87,401938 


99,977326 


5 




0,000001 


0,000001 


0,000057 


0,006308 


0,783933 


99,209701 


99,999999 


6 




0,000000 


0,000000 


0,000000 


0,000000 


0,003142 


99,996858 


100,000000 



5 

As can be seen the library consisting of > 95 % background binders (pool a) 
after selection (step 2) consists of only < 40 % background binders and > 60 
% binders (the sum of pools b-f [I b -f]) after selection and 1 round of MISE 
(step 3) using the described conditions. 

10 

These theoretical examples show that one can extract specific 
oligonucleotides from a diverse pool based on formation of homo- and 
hetero-duplexes. These examples also illustrate that parameters such as 
diversity, enrichment factor, library size and degree of separation of homo- 
1 5 and hetero-duplexes will influence the outcome of the mismatch selection 
treatment. 



EXAMPLE 2. General experimental procedure and material for single codon 



oligonucleotides 



WO 2005/078122 PCT/DK2005/000106 

85 

Double stranded DNA species are formed by extension of oligonucleotide 
primer F on single stranded templates N 0 and N 12 using Sequenase 2.0 
(Amersham Biosciences) and buffer. 

5 Heat-denaturation and annealing of samples: 

A sample of either double stranded N 0 or double stranded N 12 or a mixture of 
these in 5 pi of water containing 40 mM HEPES pH 7.5, 50 mM NaCl, 16 mM 
MgCI 2 was heated to 95°C for 10 minutes and allowed to anneal by lowering 
the temperature. 

10 

If only double stranded N 0 is present in the sample, this treatment is expected 
to result in the re-formation of homoduplexes. 

If only double stranded N 12 (4 A 12 = 1,7*10 7 different species) is present in the 
1 5 sample, this treatment is expected to result in the re-formation of 
homoduplexes and the formation of heteroduplexes. 

If both double stranded N 0 and double stranded N 12 is present in the sample, 
this treatment is expected to result in the re-formation of N 0 and N-| 2 
20 homoduplexes and the formation of N n2 -N 12 and N 0 -N-| 2 heteroduplexes 
(containing 1 or more mismatched base pairs). 

Treatment of samples with mismatch-specific enzyme: 

To a sample consisting of either double stranded N 0 or double stranded N 12 

25 or a mixture of these was added 1 pi (10x) Surveyor reaction buffer, 0.5 pi 
Enhancer, 1 pi Surveyor Nuclease, and 2.5 pi H20. The samples were then 
mixed and incubated 1 hour at 42°C. Then the samples were heated to 95°C 
for 10 minutes and allowed to anneal by lowering the temperature. To 3 pi of 
each sample was added 1 pi 5xEXT buffer [100 mM HEPES pH 7.5, 750 mM 

30 NaCl, 40 mM MgCl 2 ], 4 pi H 2 0, and 1 pi E. coli Exonuclease VII (10 U/pl) 
(USB). The samples were then incubated at 37°C for 16 hours. 
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QPCR (quantitative real-time polymerase chain reaction) analyses to 
estimate number of molecules of each species before and after treatment. 
The QPCR was performed using a Taqman probe which is a fluorescence 
resonance energy transfer (FRET) probe consisting of a short oligonucleotide 
5 complementary to one of the amplified strands. The probe contains a 
fluorofor and a quencher molecule at the 5' and 3' end of the probe, 
respectively. This probe is included in the real-time PCR reaction along with 
the required forward and reverse PCR primers. The quencher molecule 
quenches the fluorescence of the fluorofor due to its close proximity on the 
1 0 probe. As the Taq polymerase replicates the new strand of the DNA, its 5-3' 
exonuclease activity degrades the FRET probe from the 5'-end. This 
degradation releases the reporter flurofor from its proximity to the quencher, 
resulting in fluorescence of the reporter. Accumulation of fluorescence as a 
result of target amplification was detected in real time in an ABI 7900 HT 
1 5 sequence detection system (Applied Biosystems) which contains an optical 
detection systems During the exponential amplification phase, the amount of 
target should be doubling every cycle. Quantification analyses use the C T 
value, which is the point (cycle number) at which the fluorescence signal 
reaches a specific threshold level of detection in the exponential phase. The 
20 more abundant the template, the earlier this point is reached. The quantity of 
DNA in the sample can be obtained by interpolation of its C T value vs. a 
linear standard curve of C T values obtained from a serially diluted standard 
solution. 

25 Q-PCR reactions 

For 5 ml premix (for one 96-well plate): 

2.5 ml Taqman Universal PCR Master Mix (Applied Biosystems) 
450 pi RPv2 (10 pmol/pl)_ 
25 pi Taqman probe (50 pM) 
30 1075 plH 2 O 
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40.5 ul premix 4 was aliquoted into each well and 4.5 pi of relevant upstream 
PCR primer (Primer F for standard curve and N12) or the N 0 specific primers 
QP1-3 and 5 ul sample (H 2 0 in wells for negative controls) was added. 

The samples for the standard curve was prepared by diluting Temp4 to 10 
copies/5 pi and subsequently performing a 1 0-fold serial dilution of this 
sample. 5 pi was used for each Q-PCR reaction. 

Thermocyciing/measurement of fluoresence was performed on an Applied 

Biosystems ABI Prism 7900HT real-time instrument utilizing the cycling 

parameters: 

95°C10min 

40 cycles of 

95°C 15 sec 

64°C 1 min 

Oligonucleotides 
N12 oiigo: 

5'-GTCAGAGACGTGGTGGAGGAAGTCTTCCTAGAAGCTGGA 
NNNNNNNNNNN NTCT AG C AG CTAGT ATG AC GTGGTGTCC AAGCTG-3 ' 

No oiigo: 

5 ' -GCTAG AG ACGTG GTGG AG G AAGTCTTCCT AG AAGCTG G A 
TATCTTCAGTTCTCGACTCCTGAGTATGACGTGGTGTCCAAGCTG-3' 

Primer F: 

5'- CAGCTTGGACACCACGTCATAC -3' 



Primer R: 

5'- GTCAGAGACGTGGTGGAGGAA-3' 
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QP1-3: 

5'-TCATACTCAGGAGTCGAGAACTGAAGATA-3' 

Temp4: 
5 5- 

GCTAGAGACGTGGTGGAGGAAGTCTTCCTAGAAGCTGGATATCTGACG 
TGTTGAC 

GTACACAGTATGACGTGGTGTCCAAGCTG-3' 

10 TaqMan probe: 

5-6FAM-TCCAGCTTCTAGGAAGAC-MGB-NFQ (Applied Biosystems) 

6FAM : 6-Carboxyfluorescein 
MGB : Minor groove binder 
NFQ : Non-fluorescent quencher 

15 

EXAMPLE 3: Discrimination between homo- and heter oduplexes in an 
identifier oligonucleotide containing a codon. 

This example shows the possibility to specifically remove/degrade/separate 
20 heteroduplexes from homoduplexes. This is illustrates using a mismatch 

cleavage enzyme but other techniques such as for example gel separation, 
mismatch binding and column separation can also be used. 

Two samples were subjected to the experimental procedure described in 

25 Example 2: 

Sample 2A: 1 pmol double stranded N 0 in 5 pi of water containing 40 mM 
HEPES pH 7.5, 50 mM NaCI, 16 mM MgCI 2 

Sample 2B: 1 pmol double stranded Ni 2 in 5 pi of water containing 40 mM 
HEPES pH 7.5, 50 mM NaCI, 16 mM MgCI 2 

30 

Results of QPCR analyses of sample 2A: 
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Number of homoduplexes before MISE treatment: 6, 71E+08 (6, 

00E+08 expected) 

Number of homoduplexes after MISE treatment: 4, 54E+08 
Thus (4, 54 / 6, 71) = 67 % of homoduplexes have survived the MISE 
5 treatment 

Results of QPCR analyses of sample 2B: 

Number of heteroduplexes before MISE treatment: 7, 35E+08 

(6 ,00E+08expected) 
10 Number of heteroduplexes after MISE treatment: 1, 16E+07 

Thus (1, 16 / 73, 5) = 1, 5 % of heteroduplexes have survived the MISE 
treatment 



15 



That is the relative survival factor of homoduplexes is (67 / 1 , 5) = 45 



The relative survival factor obtain in this example is dependent on the 
experimental conditions. This can be future optimized if required by tuning 
the mismatch treatment conditions. The relative survival factor could also be 
different using other techniques in specifically remove/degrade/separate 
20 heteroduplexes from homoduplexes, as for example gel separation, 
mismatch binding and column separation. 

EXAMPLE 4: Enrichment of sequences in excess over a bac kground 
sequence population. 
25 This example describes the possibility to enrich a specific oligonucleotide 
among a diverse background of sequences. 

Samples were subjected to the experimental procedure described in 
Example 2: 

A mixed sample containing 1 pmol sample N 12 (4 A 12 = 1,7*10 7 different 
30 species of double stranded DNA) and 0,001 pmol of sample N 0 in 5 pi of 
water containing 40 mM HEPES pH 7.5, 50 mM NaCI, 16 mM MgCI 2 was 
subjected to general procedure 2. Then samples were diluted 300 times and 
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analyzed by QPCR (input) and the result from one mismatch selection 
treatment (output) as described in Example 2 



QPCR analysis result: 



Sample 


#N 12 


#N 0 


Fold excess N 12 


Input 


5.11E+08 


1.27E+05 


4040 


Output 


3.80E+03 


6.58E+01 


58 



5 

The enrichment factor with the mismatch selection procedure is 70 (4040/58) 
in this experiment. 

This example demonstrates the possibility to enrich for a specific 
1 0 oligonucleotide sequence (N 0 ) among a diverse pool of oligonucleotide 

sequences (N 12 j. The specific sequence (N 0 ) is in 17000-fold excess over 
each specific N 12 sequences but in 1000-fold (4040 experimentally) lower 
concentration compared to the entire population of Ni 2 sequences. Thus, 
although the background sequences are in excess, the specific sequence is 
15 able to survive the mismatch selection treatment. 

EXAMPLE 5: Sequential mismatch selection treatment 
One important feature with mismatch selection is that one can perform 
multiple round of treatment. This will permit extreme enrichment factors 
20 which might be important when the library size is much larger than the 
enrichment factor obtain in the initial selection. 

A mixed sample containing 1 pmol sample N 12 (4 A 1 2 = 1 ,7*1 0 7 different 
species of double stranded DNA) and 0,0001 pmol of sample N 0 in 5 ui of 
25 water containing 40 mM HEPES pH 7.5, 50 mM NaCI, 16>mM MgCI 2 was 
subjected to condition described in Example 2. 

Following the procedure described in Example 2, the samples were run on a 
denaturing 10 % polyacrylamide gel and a gel slice bands containing full- 
30 length templates (N 0 or N 12 ) [estimated using 32P-labeled marker oligos of 
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the same length as N 0 and Ni 2 were excised from the gel and placed in an 
eppendorph tube. 100 pi of 1 x EXT buffer [40 mM HEPES pH 7.5, 50 mM 
NaCl, 16 mM MgCI 2 ] was added. DNA was liberated from the gel by freeze- 
thawing using 2 cycles of heating to 99°C and cooling to -20°C. 400 pi H 2 0 
5 was added and 5 pi DNA mixture was used for PCR amplification with 5 pmol 
each of primers F and R in a 25 ul reaction using Ready-To-Go beads 
(Amersham Biosciences). 

Approximately 1 pmol of amplified material (containing No and N-| 2 ) was 
1 0 subjected to another round of mismatch selection as described in Example 2, 
except that the Exo VII treatment step was excluded. Then the sample was 
diluted 300 times and analyzed by QPCR 



QPCR analysis result: 



Sample 


#N 12 


#N 0 


Fold excess N 12 


Input 


1.54E+09 


3.42E+04 


45098 


Output 1 st round of general 
procedure 2 


1.30E+04 


1.99E+02 


66 


Output 2 n0 round of general 
procedure 2 


3.53E+03 


7.28E+02 


5 



15 

The enrichment factor in the 1 st round of mismatch selection treatment 
described in Example 2 is 45098 / 66 = 683 

The enrichment factor in the 2 nd round of mismatch selection treatment 
described in Example 2 (excluding the Exo VII treatment) is 66 / 5 = 13 
20 The total enrichment factor from these two mismatch selection treatment is 
8879(683*13). 

This example shows two rounds of mismatch selection treatment but this 
treatment can be continued until desired result is obtained. 
This enrichment factor obtained with the mismatch selection treatment can 
25 then be multiplied with the initial enrichment factor obtained with the standard 
selection procedure to obtain the overall enrichment factor for the complete 
selection process. 
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EXAMPLE 6. General experimental procedure and material for multiple 
codon oligonucleotides 

5 Double stranded DNA species are formed by extension of oligonucleotide 

primer ER on single stranded templates EN 0 , EN 6 and EN 12 using Sequenase 
2.0 (Amersham Biosciences) and buffer. 

Heat-denaturation and annealing of samples: 
10 A mixture of double stranded EN 0 , double stranded EN 6 and double stranded 
EN 12 in 5 pi of water containing 40 mM HEPES pH 7.5, 50 mM NaCI, 16 mM 
MgCI 2 was heated to 95°C for 10 minutes and allowed to anneal by lowering 
the temperature. 

15 This treatment is expected to result in the re-formation of EN 0 -EN 0 , EN 6 -EN 6) 
and ENi2-EN 12 homodupiexes and the formation of EN 0 -EN 6 , EN 0 -ENi 2) ENe- 
EN 6 , EN 6 -ENi 2 and ENi 2 -EN 12 heteroduplexes (containing 1 or more 
mismatched base pairs). 

20 Treatment of samples with mismatch-specific enzyme: 

To the 5 pi mixture was added 1 pi (10x) Surveyor reaction buffer, 0.5 pi 
Enhancer, 1 pi Surveyor Nuclease, and 2.5 pi H20. The samples were then 
mixed and incubated 1 hour at 42°C. Then the sample were heated to 95°C 
for 10 minutes and allowed to anneal by lowering the temperature. To 3 pi of 

25 each sample was added 1 pi 5xEXT buffer [100 mM HEPES pH 7.5, 750 mM 
NaCI, 40 mM MgCI 2 ], 4 pi H 2 0, and 1 pi E. coli Exonuclease VII (10 U/pl) 
(USB). The samples were then incubated at 37°C for 16 hours. 

Then the samples were run on a denaturing 10 % polyacrylamide gel and a 
30 gel slice bands containing full-length templates (EN 0 , EN 6 and EN 12 ) 
[estimated using 32P-Iabeled marker oligos of the same length as the 
templates) were excised from the gel and placed in an eppendorph tube. 100 
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L - Mi of 1 x EXT buffer [40 mM HEPES pH 7.5, 50 mM NaCI, 16 mM MgCI 2 ] was 

added. DNA was liberated from the gel by freeze-thawing using 2 cycles of 
heating to 99°C and cooling to -20°C. 400 pi H 2 0 was added and 5 pi DNA 
mixture was used for PCR amplification with 5 pmol each of primers EF and 
5 ER in a 25 pi reaction using Ready-To-Go beads (Amersham Biosciences). 

A TOPO-TA (invitrogen) ligation reaction was assembled with 4 pi PCR 
product, 1 pi salt solution (Invitrogen), and 1 pi vector. Water was added to 6 
pi. The reaction was incubated at RT for 30 min. Heat-shock competent 

10 TOP10 E.coli ceils were thawed on ice. 5 pi ligation reaction was added to 
the thawed cells and these were incubated 30 min on ice, heatshocked in 
42°C water for 30 sec, and then put on ice. 250 pi of growth medium was 
added and the mixture was incubated 1 h at 37oC. The mixture was then 
spread on a growth plate containing 100 pg / m! ampicillin and incubate ON 

15 at 37oC. Individual E.coli clones were picked and transferred to PCR wells 

containing 50 pi water. These 50 pi were incubated at 94°C for 5 minutes and 
used in a 20 pi in a 25 pi PCR reaction with 5 pmol of each TOPO primer 
M1 3 forward & M13 reverse and Ready-To-Go PCR beads (Amersham 
Biosciences). The following PCR profile was used: 94oC 2 min, then 30 x 

20 (94oC 4 sec, 50oC 30 sec, 72oC 1 min) then 72oC 10 min. Primers and 

nucleotides were degraded by adding 1 pi 1:1 EXO/SAP mixture (USB corp.) 
to 2 pi PCR product and incubating at 37°C for 15 min and then 80°C for 15 
min to heat-inactivate the enzymes. 5 pmol T7 primer was added and water 
was added to 12 pi. Add 8 pi DYEnamic ET cycle sequencing Terminator Mix 

25 (Applied biosystems). A thermocycling profile of 30 x (95oC 20 sec, 50oC 15 
sec, 60oC 1 min) was run. Then 5 pi water was added to each well and 
sequencing reactions were purified using seq96 spinpiates (Amersham 
Biosciences). Reactions were run on a MegaBace capillary electrophoresis 
instrument (Molecular Dynamics) using injection parameters 2 kV, 50 sec 

30 and run parameters: 9 kV 45 min and analyzed using Contig Express 
software (Informax). 
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Oligonucleotides 
EN 0 oligo: 

5 5'GTCGAATGCTGTAGCGGTAGGCAGC AATGA CGTCG AATGAC AGCAAA 
TGAGTCGATGTGCTGAGCTAGAT-3' 

EN 6 oligo: 
5'- 

1 0 GTCGAATGCTGTAGCGGTAGGCAGC ANNGA CGTCG NANGA CAGC ANAT 
NAGTC 

G ATGTGCTGAG CTAG AT -3' 

EN 12 oligo: 
15 5'- 

GTCGAATGCTGTAGCGGTAGGCAGC NNTNN CGTCG ANNNN CAGC ANNN 
GNGTC 

GATGTGCTGAGCTAGAT-3' 

20 EF oligo: 

5-GTCGAATGCTGTAGCGGTAG 

ER oligo: 

5 '-ATCTAGCTC AG C AC ATCG AC 

25 

M1 3 forward: 

5'-GTAAAACGACGGCCAG 
Ml 3 reverse: 

5'-CAGGAAACAGCTATGAC 
30 T7 primer: 

5 '-TAATACG ACTCACTATAG GG 
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EXAMPLE 7: Mismatch selection treatment and composition of 
oligonucleotide containing multiple codons. 

Libraries of bifunctional complexes will preferably contain more than one 
codon allowing encoding of multiple functional entities in the displayed 
5 molecule. This example describes an identifier containing three variable 

regions that represent three individual encoding codons. In the homo- hetero- 
duplex separation step in the mismatch cleavage procedure multiple short 
fragments are produced that could potentially recombine through overlapping 
sequences and generated shuffled variants. These shuffled identifiers would 
10 then contain codons originated from different original identifiers. This is 
tested in this example. 

Selected libraries with identifier oligos containing 3 codons was modelled 
using templates EN 0> EN 6 , and EN 12 . 
15 It is expected that a library initially containing 4 A 1 2 = 1 ,6e7 different species 
will contain an enriched best binder EN 0 and a pool of not-best binders EN 6 . 
The enriched pool of not-best is expected to be diverse (in this case the EN 6 
pool contains 4 A 6 = 4096 species but not as diverse as the background of 
non-specific binders (EN 12 contains 1,6e7 different species). 

20 

Two samples were mixed to model selected libraries: 
Sample 3.1A 

1 pmol ENi2, 0.1 pmol EN 6 , 0.1 pmol EN 0 . This corresponds to a 1,6e7 
25 member library that has been selected so that the best binder (EN 0 ) has 

been enriched 1,6e6 fold and the not-best binders (EN 6 ) have been enriched 
390 fold (1,6e6/4096). 

30 Sample 3. 1B 

1 pmol EN12, 0.01 pmol EN 6) 0.01 pmol EN 0 . This corresponds to a 1,6e7 
member library that has been selected so that the best binder (EN 0 ) has 
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been enriched 1,6e5 fold and the not-best binders (EN 6 ) have been enriched 
39 fold (1,6e5/4096). 

The samples were subjected mismatch selection as described in Example 6. 

5 



Sample 3.1A 





EN 12 


EN 6 


ENo 


Input (sequencing) 


11 


2 


2 


Output (sequencing) 


5 


0 


6 



This corresponds to an ENo enrichment fold of (6/1 1 )/(2/(2+2+1 1 )) = 4 

INPUT 1 No : 1 N 6 : 10 N12 

» F01 
r> B02 
» G02 
s> DQ1 
» E03 
» E01 
» E02 
» F02 
2> CQ1 
» F03 
» A03 
» K03 
* AQ2 
» D03 
» C04 



» A05 
» B05 
» DOS 
» AO 6 
a> AO 6 
» COS 
» B06 
» E06 
> G06 
» COS 
» H06 



( 1) GCT CTAGCGGTAGGCAG^^^|CGTa^p^pCAGCA^^^GTCGAT GT GCT GAGCTAG 

( 1) GCT GT AGC G CT AGGCAGCi^^i^CGT CG^^^^ICAGCT ^^^^^GTCGAT GT GCT GAGCTAG 

(32) GCT GTAGCGGT AGGCAGC^^^CGTCG^^KcAGCA^^^pGTCGAT GT GCT GAGCTAG 

(32) GCT GTAGC GGTAGT CAGC AGSGAC GT CGJS^&C^GC^G^^GT CGAT GT GCT GAGCTAG 

(3 2 ) GCT GTAGCGGT AGGCAGCTATACCGT CGA;TCCACAGCL\GGAGGGTCGAT GT GCT GAGCTAG 

(2 9 ) GCT GTAGC GGT AGGCAGCCATAAC GT CGACGAC CAGCAGCAGAGT CGAT GT GCT GAGCTAG 

(32) GCT GTAGCGGT AGGCAGCTTT TCCGT CGATGGT CAGCAGCAGGGTCGAT GT GCT GAGCTAG 

(32) GCT GTAGC GGT AGGCAGCGGT CTCGT CGATGTACAGCAAAAGAGT CGAT GT GCT GAGCTAG 

(53 ) GCT GTAGC GGTAGT GCAGCCGTACGT CGA^tiAGCAGC ACAGAGT C GAT GT GCT GAG CT AG 

(32) GCTGTAGCGGT AGGCAGCCAT.TGCTT CGAGC CAC AG CAAGG GGGT C GAT GT GCT GAGCTAG 

(32 ) GCT GTAGCGGT AGGCAGCAATAGCGT CGAGGGACAGCAGGCGGGT C GAT GT GCT GAGCTAG 

(32) GCT GTAGC GGT AGGCAGCGGT ACC GT CGAATCAC AGCACCTGAGT C GAT GT GCT GAGCTAG 

(32) .GCT GTAGCGGT AGGCAGCGGT AGCGT CGAAG^GACAGCAGAI .GAGTCGATGT GCT GAGCTAG 

(3 2 ) GCT GTAGC GGT AGGCATCCGTAGCGT CGAAAGT CAGCAGGGSAGTCGAT GT GCT GAGCTAG 

(32) GCT GTAGC GGT AGGCAGCGGTAT C GT CGA^^CAGCAG^GGGT CGAT GT GCT GAGCTAG 




OUTPUT 1 No : 1 N 6 : 10 

(3 3 ) GCT GTAGCGGT AGGCAGC^^pCGTCG^^|CAGCAp^^GT CGAT GT GCT GAGCTAG 

(2) (33) GCT GT AGC GGT AGGCAGCSS^^KCGT CG^^^|cA— CASK/CT^fitGT CGAT GT GC T GAGC T AG 

(33) GCT GTAGC GGT AGGC AGC^^^^CGT C(^^^^^CAGCA^^^^iGT C GAT GT GCT GAGCTAG 

(2) (35) GCTGTAGCGGTAGGC^GCfl^ 

(33) GCT GTAGCGGT GGGCLAGCg^SCGTCG^^KCAGCA^^KAGT CGAT GT GCT GAGCTAG 
(29) GCT GTAGCGGT AGGC AGC^^McGT CGj^^CAACA^g^GTCGAT GT GCT GAGCTAG 

GCT GTAGC GGT AGGC AGC GAT TACGT CGACAGT CAG CACAAGG GT CGAT GT GCT GAGCTAG 

(34) GCT GTAGC GGT AGGC AGC AGT CTCGT CGAT GGC CAGCAGGAGAGT CGAT GT GCT GAGCTAG 
(2) (34) GCT GTAGC GGT AGGGAGC CT iXACGT CGi^C(fecAGCAGAA<feGT C GAT GT GC T GAGCT AG 
(2) (28) GCT GTAGCGGT AGGCAGCAATAGCGT CGAT A^GCAGCAAGAGCGT CGAT GT GCT GAGCTAG 
(2) (34) GCT GT AGCGGT AGGCAGC AGT AC CGT T GIAAC j^CAGXllAGGCGAGT C GAT GT GCT GAGCTAG 



Sample 3.1B 





EN 12 


EN 6 


EN 0 
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Input (theoretical) 


100 


1 


1 


Output (sequencing) 


6 


3 


1. 



This corresponds to an EN 0 enrichment fold of (1/(6+3+1 ))/(1/1 00) = 10 
and an EN 6 enrichment fold of (3/(6+3+1 ))/(1/1 00) = 30 

INPUT 1 N 0 : 1 N 6 : 100 N 12 



Ir 



» F03 
y> GO 3 
» A03 
y> C04 
» HQS 
* E01 
» D02 
» G01 
» A02 
i> D04 



(31) 
(29) 
(30) 

(1) 
(29) 
(30) 

(1) 
(29) 
(25) 
(30) 



OUTPUT 1N 0 :1N 6 : 100 



GCTGTAGCGGTAGGCAGC$ 



&CGTC 



.^J^CAGCA^p^GTCGATGTGCTGAGCTAG 
GCTGTAGCGGTAGGCAGC&GT^CGTCG^ 
GCT GT AGCGGT AGGCAGCi^^^CGT CGC^G^Cl^GOW^-.^GTC GATGT GCT GAGCTAG 
GCT GT AGCGGT AGGCAGC^^ffiCGT CG§^B|cAGCA^§|kSGTC GAT GT GCT GAGCTAG 
GCT GT AGCGGT AGGCAGCAAT AGCGT CGJ^T ACCAGCAGCT GAGTCGAT GT GCT GAGCTAG 
GCT GT AGCGGT AGGCAGCGAT iSACGT CGAGGGGCAG CAAAGGAGTC GAT GT GCT GAGCTAG 
GCT GT AGCGGT AGGCAGCAGTCACGT CGAGAAG CAGCAGGAGAGTC GAT GT GCT GAGCTAG 
GCT GT AGCGGT AGGCAGCGGTGACGT CGAAGTGCAGTA^AGGGTCGATGT GCT GAGCTAG 
GCT GT AGC GGT AGGCAGT AATGCCGT CGATCGACAGC^CGG GAGTCGAT GT GCT GAGCTAG 
GCT GT AGC GGTAGGCAGCCTT AGCGT CGMT TGC^GCAGACGCGTCGAT GTGCT GAGCTAG 



These two results shows that enrichment of the expected sequences (N 0 , N 6 ) 
are identified by sequencing and the composition of the identifier sequences 
are kept intact. There is no shuffling between the codons from the N 0 , N 6 , N 12 
1 0 oligonucleotides. 



Example 8 : Identification of a liqand from a library composed of 61875 
different small molecules each associated with a unique identifer 
15 oligonucleotide by selection and subsequently using mismatch selection 
(MISE). 



General arrangement of each complex composed of display molecule and 
identifier oligonucleotide in the library generation: 

20 
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/ \ O hgo Ax 



98 

Oligo Bx 



Oiigo Cx 



\ 

HN 



Oligo a 



Oligo bx 



)r-R c -NH 2 



Oligo cx Oligo d 



5* 



Overview of the library generation procedure: 



First round of library generation (round A) : 




Building block A x (BB A x) 



1 0 H 2 N~5'-template 



i H / 

J Pnt-N-R A -^ 



HN-5'-template 



H 2 N-R A -^ 

HN— 5-tempiate 



"Pnf corresponds to pentenoyl - an amine protecting group. "R" can by any 
molecule fragment. The chemical used in library generation comprise a 
15 primary (shown) or a secondary amine. 



Second round of library generation (round B) : 



20 



H 2 N-Rb-^ 

HN-5'-template 




^ Building block B x (BB B x) 



HN-R A -^ 



HN-5'-tempIate 



NH 
Pnt 



HN-R A -^ 

HN-5Memplate 

NH 2 
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Third round of library generation (round C): 



m Building block C x (BB c x) 







OH 



HN— 5-template 



HN-5'-temptate 



HN-5'-temp!ate 



HN 




HN 

^~Rc-NH 2 



O 



O 



5 



General procedure : Library generation, selection and mismatch subsequent 
selection 

10 First round of library generation (round A) : 

First oligonucleotides of the A series are each modified by adding to each 
type of oligo a small molecule building block (BB A x) to the 5' amine forming 
an amide bond. After this step the template is comprised of oligo Ax. 

15 Second round of library generation (round B) : 

4 nmol of a mixture of different modified A oligos are then split into a number 
tubes corresponding to the number of different building blocks to be used in 
round B. 190 pmol Oligo a and 2 pi heering DNA is added to each tube and 
the DNA material in each tube is lyophiiized. The lyophilized DNA is then 

20 redissolved in 50 pi water and purified by spining through Biospin P-6 
columns (Biorad) equilibrated with water. 

Addition of building block: 

The DNA material in each tube is again lyophilized and redissolved in 2 pi 
25 100 mM Na-borate pH 8.0/100 mM sulfo N-hydroxy succinimide (sNHS). For 
each tube 10 pi building block BB B x (100 mM in dimethyl sulfoxide [DMSO]) 
is preactivated by mixing with 10 pi 1-Ethyl-3-(3-dimethylaminopropyl)- 
carbodiirnide (EDC) (90 mM in dimethylformamide [DMF]) and incubating at 
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30°C for 30 min. 3 pi of this preactivated mixture is then mixed with the 2 |jl in 
each tube and allowed to react 45 min at 30 °C. Then an additional 3 pi 
freshly preactivated BB is added and the reaction is allowed to proceed for 
45 min at 30 °C. The resulting mixture is then purified by spinning through 
5 Bio-Rad P6 DG (Desalting gel). 

Addition ofcodon oligonucleotide : 

The DNA material is then lyophilized and redissolved in 10 pi water 
containing 200 pmol oligo Bx (eg. B1) and the corresponding oligo bx (eg. 

10 b1). This is done so that the codon in oligo Bx identifies the BB B x added to 
the DNA template. 10 units of T4 DNA iigase (Promega) and 1 .2 (j| T4 DNA 
iigase buffer is then added to each tube and the mixture is incubated at 20°C 
for 1 hour. The DNA template linked to the small molecules now comprises 
an Ax oligo with a Bx oligo ligated to its 3' end. The reactions are then 

1 5 pooled, an appropiate volume of water is allowed to evaporate and the 

remaining sample is purified by spining through Biospin P-6 columns (Biorad) 
equilibrated with water. 

Removal of building block protecting group: 

20 The pooled sample (~ 50 pi) is adjusted to 10 mM Na-acetate (pH 5). 0.25 
volumes of 25 mM Iodine in tetrahydrofuran/water (1:1) is added and the 
sample is incubate at 37 °C for 2h. The reaction is then quenched by addition 
of 2 pi of 1M Na 2 S 2 03 and incubation at room temperature for 5 min. The 
complexes are then purified by spining through Biospin P-6 columns (Biorad) 

25 equilibrated with water 

To remove sulphonamide protecting groups, the sample is adjusted to 50 pi 
100 mM sodium borate pH 8.5 and 20 pi 500 mM 4-methoxy thiophenol (in 
acetonitrile) is added and the reaction is incubated at 25°C overnight. Then 
30 the complexes are purified by spinning through Biospin P-6 columns (Biorad) 
equilibrated with water and then lyophilized. 
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Third round of library generation (round C) : . 
The samples are dissolved in 175 pi 100 mM Na-borate pH 8.0 and 
distributed into 25 wells (7 pi / well). 2 pi 100 mM BB c x in water/DMSO and 1 
pi of 250 mM DMT-MM is added to each reaction and incubated at 30 °C 
5 overnigth. Water is added to 50 pi and the reactions are then spin purified 
using Bio-Rad P6 DG (Desalting gel) and subsequently water is allowed to 
evaporate so that the final volume is 1 0 pi. 

Addition of building block: 

10 The DNA material is then lyophilized and redissolved in 10 pi water 

containing 200 pmol oligo Cx (eg. C1) and the corresponding digo cx (eg. 
c1). This is done so that the codon in oligo Cx corresponds to the BB c x 
added to the DNA template. 10 units of T4 DNA ligase (Promega) and 1.2 pi 
T4 DNA ligase buffer is then added to each tube and incubated at 20°C for 1 

15 hour. The DNA template linked to the small molecules now comprises and Ax 
oligo with a Bx ligated to its 3' end and a Cx oligo ligated to the 3' end of the 
Bx oligo. The reactions are then pooled, the pooled sample volume is 
reduced by evaporation and the sample is purified by spining through Biospin 
P-6 columns (Biorad) equilibrated with water. The pooled sample (~ 50 pi) is 

20 adjusted to 10 mM Na-acetate (pH 5). 0.25 volumes of 25 mM Iodine in 

tetrahydrofuran/water (1:1) is added and the sample is incubate at 37 °C for 
2h. The reaction is then quenched by addition of 2 pi of 1M Na2S 2 0 3 and 
incubation at RT for 5 min. Then the DNA templates (carrying small 
molecules) are purified by spinning through Biospin P-6 columns (Biorad) 

25 equilibrated with water and then lyophilized. 

Final deprotection step 

Some building blocks contain methyl esters that are deprotected to acids by 
dissolving the pooled sample in 5 pi 20 mM NaOH, heating to 80 °C for 10 
30 minutes and adding 5 pi of 20 mM HCI. 

Final extension step 
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To ensure that the DNA templates are double stranded prior to selection 
oligo d is extended along the template by adding to the sample 10 pi of 5 X 
sequenase EX-buffer [100 mM Hepes, pH 7.5, 50 mM MgCI 2 , 750 mM NaCI] 
and 4000 pmoi oligo d. Annealing is performed by heating to 80°C and 
5 cooling to 20 °C. To the sample is then added 500 pL dNTP , water to 50 pi 
and 39 units of Sequenase version 2.0 (USB). The reaction is incubated at 
37°C for 1 hour. 

Selection 

10 This library is subjected to selection, whereby binders to the selection target 
are enriched. 

Maxisorp ELISA wells (NUNC A/S, Denmark) were coated with each 100 pL 
2pg/mL integrin aV(33 in PBS buffer [2.8 mM NaH 2 P0 4 , 7.2 mM Na 2 HP0 4j 
0.15 M NaCI, pH 7.2] overnight at 4°C. A ien the integrin solution was 

15 substituted for 200 pi blocking buffer [1^3, 0.05% Tween 20 (Sigma P-9416), 
1% bovine serum alnumin (Sigma A-7030), 1 mM MnCI 2 ] which was left on 
for 3 hours at room temperature. Then the welis were washed 10 times with 
blocking buffer and the encoded library was added to the wells after diluting it 
100 times with blocking buffer. Following 2 hours incubation at room 

20 temperature the wells were washed 10 times with blocking buffer. After the 
final wash the wells were cleared of wash buffer and subsequently inverted 
and exposed to UV light at 300-350 nm for 30 seconds using a trans- 
illuminator set at 70% power. Then 100 pi blocking buffer without Tween-20 
was immediately added to each well, the wells were shaken for 30 seconds, 

25 and the solutions containing eluted templates were removed for PCR 
amplification and then used for mismatch selection (MISE). 

Mismatch selection (MISE) : 

The double stranded sample is denatured by heating and allowed to cool 
30 whereby hetero- and homoduplexes are formed: 



Treatment of samples with nucleases: 
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To the 5 pi mixture ia added 1 pi (10x) Surveyor reaction buffer 
(Transgenomic), 0.5 ul Enhancer (Transgenomic), 1 ul Surveyor Nuclease 
(Transgenomic), and 2.5 pi H 2 0. The samples are then mixed and incubated 
1 hour at 42°C. Then the sample is heated to 95°C for 10 minutes and 
5 duplexes are allowed to form by lowering the temperature. To 3 pi of each 
sample is added 1 pi 5 x EXT buffer [100 mM HEPES pH 7.5, 750 mM NaCI, 
40 mM MgCI 2 ], 4 pi H 2 0, and 1 pi E. coli Exonuclease VII (10 U/pl) (USB). 
The samples are then incubated at 37°C for 16 hours. 

10 Purification by polyacrylamide gel electrophoresis: 

Then the samples are mixed with loading buffer and run on a denaturing 10 
% polyacrylamide gel and gel slices containing full-length templates 
(estimated using 32P-labeled marker oligos of the same length as the 
templates) are excised from the gel and placed in an eppendorph tube. 100 

15 pi of 1 x EXT buffer [40 mM HEPES pH 7.5, 50 mM NaCI, 16 mM MgCI 2 ] are 
added. DNA is liberated from the gel by freeze-thawing using 2 cycles of 
heating to 99°C and cooling to -20°C. 400 pi H 2 0 is added and 5 pi DNA 
mixture is used for PCR amplification with 5 pmol each of forward and 
reverse primers and in a 25 pi reaction using Ready-To-Go beads 

20 (Amersham Biosciences). 

Cloning of MISE products 

A TOPO-TA (Invitrogen) ligation reaction is assembled with 4 pi PCR 
product, 1 pi salt solution (Invitrogen) and 1 pi vector. Water is added to 6 pi. 

25 The reaction is then incubated at RT for 30 min. Heat-shock competent 

TOP10 E.coli cells are then thawed on ice and 5 pi of the ligation reaction is 
added to the thawed cells. The cells are then incubated 30 min on ice, 
heatshocked in 42°C water for 30 sec, and then put on ice again. 250 pi of 
growth medium is added to the cells and they are incubated 1 h at 37°C. The 

30 medium containing cells is then spread on a growth plate containing 100 pg / 
ml ampicillin and incubated at 37°C for 1 6 hours. 
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Sequencing of MISE products: 

Individual E.coli clones are then picked and transferred to PCR wells 
containing 50 pi water. These 50 ul were incubated at 94°C for 5 minutes and 
used in a 20 ul in a 25 pi PCR reaction with 5 pmol of each TOPO primer 

5 M1 3 forward & M1 3 reverse and Ready-To-Go PCR beads (Amersham 

Biosciences). The following PCR profile is used: 94°C 2 min, then 30 x (94°C 
4 sec, 50°C 30 sec, 72°C 1 min) then 72°C 10 min. Primers and nucleotides 
are then degraded by adding 1 ul 1:1 EXO/SAP mixture (USB corp.) to 2 pi 
PCR product and incubating at 37°C for 15 min and then 80°C for 15 min to 

10 heat-inactivate the enzymes. 5 pmol T7 primer is added and water is added 
to 12 pi. Then 8 pi DYEnamic ET cycle sequencing Terminator Mix (Applied 
biosystems) is added to each well. A thermocycling profile of 30 x (95°C 20 
sec, 50°C 15 sec, 60°C 1 min) is then run. Then 10 pi water is added to each 
well and sequencing reactions are purified using seq96 spinplates 

15 (Amersham Biosciences). Reactions are then run on a MegaBace capillary 
electrophoresis instrument (Molecular Dynamics) using injection parameters 
2 kV, 50 sec and run parameters: 9 kV 45 min and analyzed using Contig 
Express software (Informax). 
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Fxample 8.1 (general procedure 8): Generatin g of a library/ containing 61875 
different small molecules and identification of a b inder among these bv 
subjecting the library to target selection and su hseouent mismatch selection 
(MISE) : 

5 

F/rs'f round of library generation (round A) : 

99 different A oligos were used : 

1 0 Oligo Ax genera! structure : 

5'- NSP-ACCTCAGCTGTGTATCGAGCGGCAGCTGTTCCGTCG-3' 

The underlined part corresponds to the 5 nucleotide sequence that varies 
1 5 among different A oligos, ie. the codon. The remaining sequence is identical 
among the A oligos. 

N : 5'-Amino-Modifier 5 (Glen research cat# 1 0-1 905-90) 
S : Spacer C3 CPG (Glen research cat# 20-2913-01) 
20 P : PC Spacer Phosphoramidite (Glen research cat* 10-4913-90) 

Oligo a: 5'- TGTG CG ACGI 1 1 1 1 GCTG CCG CTCG ATACACAG CTG AGGT 
I : inosine 

Onto the free NH 2 of the 5' amino modifier on these oligos were loaded the 
listed BB A x building blocks: 

25 



30 
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Oligo A7 


ACCTG 


BB A 7 


HCL 


^ 3\ 


Oligo A8 


CTGGT 


BB A 8 


nrV 


Ok 


Oiigo A9 


TAGGA 


BB A 9 


o 

\ OH 


Oligo A10 


ACTCA 


BB A 10 




Oligo A11 


CTTAC 


BB A 1 1 


> V NH \==/ 


Oligo A12 


CGCAC 


BB A 12 




Oligo A13 


TCGCG 


BB A 13 
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Oligo A14 


CGGAT 


BB A 14 




Oligo A15 


GAGAT 


BB A 15 




Oligo A16 


TGTAG 


BB A 16 


O l N 


Oligo A17 


GTGTT 


BB A 17 


HO 




AGATG 


BB A 18 


HO ^ 


Oligo A19 


ATCCT 


BB A 19 


0 ^^^^ 
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Oliao A20 
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Oiigo A21 
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10 



Oligo A26 


TGCTT 


BB A 26 


\f 

H 


Oiigo A27 


TTAGC 


BB A 27 




Oligo A28 


GCTGA 


BB A 28 




Oligo A29 


GAACG 


BB A 29 


OH > f 


Oligo A30 


CATGG 


BB A 30 





30 
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Oligo A31 


TGGTA 


BB A 31 




y — oh 
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Oiigo A32 
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BB A 32 
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Oligo A35 


ACTAG 
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10 



Oligo A36 


TACCT 


BB A 36 






Oligo A37 


TACGA 


BB A 37 




_> 


Oiigo A38 


CTTCA 


BB A 38 


/ 


y . 


Oligo A39 


CTCTT 


BB A 39 






Oiigo A40 


TCATC 


BB A 40 
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Oligo A42 
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BB A 42 
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HN 
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1 Oligo A44 
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Oligo A45 
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Oligo A46 


TAACA 


BB A 46 




Oligo A47 


TAACA 


BB A 47 




Oligo A48 


CCAGG 


BB A 48 




Oligo A49 


ATGTC 


BB A 49 
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Oligo A50 
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Oligo A51 
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Oligo A52 


GACTT 


BB A 52 




I || ] 




Oligo A53 


GGTGG 


BB A 53 




.OH 


0 

I 


Oiigo A54 


CAACT 


BB A 54 


o 

HO \ 

^0 


I 


Oiigo A55 


ATGAG 


BB A 55 


o 


Oligo A56 


TCTGC 


BB A 56 


HO ^ 


I 
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Oligo A57 


ATAGG 


BB A 57 


HO-/ 




Oligo A58 
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BB A 58 
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Oligo A63 


AATCG 


BB A 63 


J F 


Oligo A64 


AAGCT 


BB A 64 


\ u 


Oligo A65 


CCGAA 


BB A 65 


\ ff 


Oligo A66 
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Oligo A67 
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Oligo A69 


TGAGG 
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Oligo A74 


AGCAA 


BB A 74 


HO 


Oligo A75 


TTCCA 


BB A 75 


HCL 


> ^^^^^^^^^ 


Oligo A76 ! 


AGACT 


BB A 76 
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BB A 77 


HO. 

C 
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Oligo A79 


AACGT 


BB A 79 
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Oligo A98 
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20 

Second round of library generation {round B) : 

25 different combinations of BB B x and [oiigo Bx-oligo bx] pairs were used. 

25 Onto the free NH 2 -group of the above loaded and deprotected BB A x building 
blocks were loaded the below listed BB B x building blocks: 

Oligo Bx general structure : 
30 5'- HPOs-CACA AGTACGAACGTGCATCAGAG -3' 
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The underlined part corresponds to the 20 nucleotide sequence that varies 
among different B oligos, ie. the codon. The remaining sequence is identical 
among the B oligos. 

The corresponding b oligos used have the general structure : 
5-HPO3- TCCT CTCTGATGCACGTTCGTACT 

Every b oiigo can anneal to a specific B oligo. As can be seen the above 
shown B oligo can anneal to the above shown b oligo. 
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Third round of library generation (round C) : 
5 25 different C oligos were used : 
Oligo Cx general structure : 

5'- HPOs-CACA AGTACGAACGTGCATC AGAG-3' 

10 

The underlined part corresponds to the 20 nucleotide sequence that varies 
among different C oligos, ie. the codon. The remaining sequence is identical 
among the C oligos. 

1 5 Onto these oligos were loaded the listed BBcx building blocks: 
Oligo Cx example : 5'-HP0 3 - 

AGG ACGAGCAGGACCTGGAACCTGGTGC GTTCCTCCACCACGTCTCCG 
Oligo cx example : 5'- GCACCAGGTTCCAGGTCCTGCTCG 

20 



25 



30 
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Final extension 

Oligo d: S'-CGGAGACGTGGTGGAGGAAC 

Results of sequencing the identifier oligonucleotides that result from the 
target selection 

5 

Fraction of templates encoding any binder before selection (estimated): 
1 in 61.875 

Sequenced identifier oligonucleotides after selection (codons corresponding 
10 to small molecule X are shown in bold italics): 



» G052.abd (53) 

CGGCAGCCATGGCGTCGC^CAATCTTGGTTGTTCTCCTGCGAGGACC^TCTCGAACGACCTGGGGGTT 
» F012.abd (53) 

15 CGG CAG CACGATCGTCG CACAGGAACCATCAAGATCCTGA CGTT 

» E02.abd (53) 

CGGCAG CAACTACGT CG CACATAGTCTCCT CCACTTC CATGAGGACC ATCTCGACGAC CTACTC CTGGG GTT 
» E032.abd (52) 

CGGGTGCCAAGGTGGTGGTCGTCGTCTCGCAGGAGAACAACCAAGATTGTGCGACGGCTAAGCTGCCGCT 

20 » E03.abd (53) 

CGGCAGCACACCCGTCGCACATTCACTCCATGCAGCATGCGAGGACCTGACACTGGTCGTGGTCGAGGCGTT • 

» H022.abd (52) 

CGGCAGCGTACTCGTCGCAC^TCAAGGTTGGTGGTGTAACTGAGGACTCGACCACTGCAGGTGGAGTTCCGTT 
» EOl.abd (53) 

25 CGGCAGCTTTGTCGTCGCACATCGAACTITGTTGCTTCCTCGAGGACCATCTCGACGACCTGCTCCTGGGGTT 
» H042.abd (52) 

CGG C AG CCTTACCGTCG CAC ATC CAGTGCAAGACTGAACAGAGGACCTGGTTGTCG AGGTGAG C AGCAGCGTT 
» F042.abd (53) 

CGG CAG CCTTACCGTCG CAC ATC AACAACTTGGTTCTTGGGAGGACCACGTCCACAGGTG CAC CAG GTGGTT 

30 » EOS.abd (53) 

CGGCAGCCCTCGCGTCGCACATAGTCTCCTCCACTTCCATGAGGACCTGGTGCGCCACGACGTGCTTCGGTT 

» E04.abd (52) 

CGG CAG CGTTATCGTCG CACATCCAGTG CAAG ACTG AACAGAGGACCACTG AG CTG CTCCT CC AGGTGGGTT 
» G032.abd (53) 

35 CGG CAGCTAACACGTGG CACATCCAGCTTCTAGGAAGACAGAGGACCTACCTCGTG CTCACGTCCTGCCGTT 

» F062.abd (52) 

CGGGTGCTGGTCGTCTGCAGC^TCGTCCTCTGCTAGTGCACTCAAGAAGTGTGCGACGGGAATGCTGCCGCT 
» H062.abd (53) 

CGGCAGCGGATACGTCGCACAAGTACGAACGTGCATCAGAGAGGTCGACCCTGCAGGTGGAGCTCCGTT 

40 » G062.abd (54) 

CGGCAGCAGCTCCGTCGCACATCCAGTGCAAGACTGAACAGAGGACAGCACCTGGAGGTAGGACCACGGGTT 
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» E022.abd (53) 

CGGCAGCTAAC^CGTCGCACATCGAACTTGTTGCTTC 
» E012.abd (53) 

CGGGTGCTGGTCGTCTGCAGC^TCGTCCTCTGCTAGTGCACTCAAGAAGTGTGCGACGGGAATGCTGCCGCT 

5 » H012.abd (53) 

CGG CAGCATTCCCGTCG CAC&CTTCTTGAGTGCACTAGCAGAGGA.CGATG CTGCAGACGACCAGCA CCCGTT 

» E062.abd (53) 

CGCGACCAGGATGGACCTCGTCGAGTCCTCTGTTCAGTCTTGCACTGGATGCGACGGCACTGCTGCCGCT 

» F032.abd (53) . 
10 GGAA cCTGGACAGTTGGAGACCTCGTGGTCCTCG^ 
» F022.abd (52) 

CGGCAGCCGCACCGTCGCACAGCATCTTGAGTCGTO 
». H052.abd (53) 

CGGCAGCACAC C CGTCG CACAGG AACATCAAAG ATCCTGAGAGGACCATCTCGACG AC CTG CTC CTGG GGTT 

15 » G022.abd (53) 

CGGCTGACGAGGTCCTCGAACTGGTCCTCACCTAGTAGGAACGTCCTTTGTGCGACGAGTTGGCTGCCGCT 

» G042.abd (53) 

CGGCAGCTCTGCCGTCGCACACTTCTTGAGTGCAC 
» E042-abd (55) 

20 CGCTGCTGCTCACCTCGAC^CCAGGTCCTCCTTGGATCTCGTCAGAGATTGTGCGACGGCTCGGCTGCCGCT 
» E052.abd (54) 

CGGCAG C CACTCCGTCG CAC ACTGAGTGTGTAGT ACCAACGAGGACGAG CACG AGG AG CACGTGTC CAGCGTT 
» G012.abd (54) 

CGGCAGCGTTCCGTCGCACATTCAGTGCTTGAGCTAACTGAGGACACTCGT^GATGATCCTGCTACCGTyGGGTT 

25 » H032.abd (53) 

CGGCAGCATCCTCGTCGCACATAGTAGCTTGGTACGTATGACCGAGGACCACAGAAGGTCT 

Fraction of identifiers encoding binder X after selection before MISE 
treatment: 1 out of 28. Enrichment fold in selection : (1/27)/(1/61 .875) = 2210 
30 fold (theoretical maximum is 61 875 fold). 

Results of sequencing the identifier oligonucleotides that result from 
mismatch selection (MISE) (codons correspondi ng to small molecule X are 
shown in bold italics): 

35 

» MISE5 rl40 (69) GGCAGCArrCCCGGTCGCACACrrcrrGAGTGCA- 
CTAGCAGAGGACGATGCTGCAGACGACC&TGCACCCGTTC 

« MISE5_rll2 (192) GGCAGCArTCCCGTCGCACACrrCTTGAGrGCfl- 
40 CTAGCAGAGGACGATGCTGCACSACGACCAGCACCCGTTC 

« MISE5_rll7 (183) GGCAGCTATrCCACTGTCGCTACACrrC2TGrAGTGCA- 
CTAGCAGAGGACGArGCrGCArACAGACCAGCACCCGTTC 
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« MISE5_rll5 (201) 

G G CAG CArrCTCT CGTCG CA<^CTTttT(&GTGCAGCTAGC!AGAGGATCGATGCTGCATGA CGATCCAGCACCCG T 
« MISE5_rlll (194) 

G CTAG CA rrCG CCGTTGCZlC&CTTCTTGAGTGCAGTAGCAGAGG&CGATG ctg cagacgagccagca cccgttc 
5 « MISE5_rl48 (232) , 

GGCAGCATrCCCGTCG CACACTTCTTGAGTGCATTAG CAG AG GAC GA TGCTGCAGAGCGA CCAGCACCCGTTC 

« MISE5_rl23 (208) 

G GCAG CGTTTCG CGTGCG CACACTTCGTTGAGTGCAATCTAG CAGAGG ACTGATGCTG CTAGACGACCAGCACCCGTT 
« MISE5_rl37 (212) 

1 0 GGCAG CA!TrCCCGTCGCACACrircrr(?AGrGCAG!CAGCAGrAGGAC(^ TGCTGCA1SA CGAGCCAGCACCCGTTC 

« MISE5_rl44 (228) 
GGCTVGCArrCCACGTCGCTACACrr^ 
« MISE5_rl32 (215) 

TGGCAGCATTCCCGTCG C ACACT!r?07rGAGTGCACTAGCATGAGGATCGA TGCTGCAGAGCTACCAGCACCCGTT 
/ --, 15 « MISE5_rl26 (235) 

( ) GG CAGCATGTCCCGTCGCTACGCTTCTTGAGTGCA TCIEAGGCAGAGG ACGATGGGCTGCAGA CGACCAGCA CCCGTT 

« MISE5_rl05 (247) 
GGCAGCATTCCCGTCGCAC^CrrrCIT^^ 
« MISE5_rl31 (298) 

20 GGCAGCATITCCCGTCGCAC ACTrcrTGAGTGCACrAGCAGAGGACGATGCrGrGCAGA CGACCAGCA CCCGTT 



Fraction of templates encoding binder X after MISE treatment : 12 out of 13 
Enrichment fold : (12/13)7(1/28) = 26 (theoretical maximum is 28 fold). 



25 



The output of the selection process shows 26 different sequences. Thus, it is 
not possible to rank the corresponding display molecule in accordance with 
their affinity towards the integrin target. The subsequent Mismatch Selection 
allows the clear conclusion that a single display molecule prevails over all 
30 others. 

EXAMPLE 9 : Using mismatch selection to identify thombin-binding ligands 
from a 1.105.920-member small molecule library. 

35 A library of 1 .105.920 small molecules displayed from identifier 

oligonucleotides was provided essentially as outlined in example 8 except 4 
rounds of library generation were performed (rounds 1 , B, C and D) each 
including addition of a building block and addition of the corresponding codon 
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oligonucleotide. The first, third and fourth additions of building blocks (rounds 
A, C and D) were performed using acylation reactions forming amide bonds 
whereas addition of the second building block (round B) was performed using 
reductive amination forming a carbon-nitrogen bond. The library included 1 0 
5 different building blocks at position 1 (round A), 48 building blocks at position 
2 (round B). 48 building blocks at position 3 (round C), and 48 building blocks 
at position 4 (round D). 

After library generation a fraction of the library contacted with a target. In the 
1 0 present example the target is the protein Thrombin. However, any target may 
be used. 

The unbound library fraction was removed by washing and the remaining 
bound fraction was recovered from the target by specific elution, i.e., 

1 5 displacement of the target-bound library fraction from the target by addition of 
a specific high affinity thrombin binding molecule at an appropiate 
concentration. The eluted library fraction (hereafter referred to as the output) 
was then purified by spin column -filtration. The process of contacting the 
library with thrombin followed by elution (this process is hereafter referred to 

20 as pseudorounding) was repeated a 4 times. A fraction of the eluted library 
from pseudoround 4 was either a) applied to a further pseudoround 
(pseudoround 5) or b) subjected to mismatch selection conditions. 

Among the 1.105.920 display molecules in the library are molecules that 
25 correspond to the building block combinations BB572-BB1023-BB1025 and 
BB479-BB1 023-BB1 025 both correspond to thrombin binders, e.g., the Kd 
(equilibrium dissociation constant) of the binding between thrombin and a 
small molecule derived from the BB479-BB1023-BB1025 building block 
combination was determined to be 1.1 uM using surface plasmon resonance 
30 technology (Biacore). 
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PCR amplification of a fraction of the library before pseuodoround 1 and 
outputs from pseudoround 4 and pseudoround 5 was performed followed by 
cloning and sequencing as described in example 8. The fraction of identifier 
oligonucleotide sequences codons corresponding to the mentioned building 
5 block combinations BB572-BB1023-BB1025 and BB479-BB1023-BB1025 
are shown in table 9.1. 

TABLE 9.1 : Relative abundances of building blocks as defined by their 
1 0 corresponding identifer oligonucletide codons 



Building block 
(Position) 


Library 
before 

pseudoround 
1 


Pseudoround 4 
output 


Pseudoround 5 
Output 


Output of 

mismatch 

selection 

performed 

on 

pseudoround 
4 output 


BB479 (2) 


0.01 


0.09 


0.13 


0.16 


BB572 (2) 


0.01 


0.09 


0.10 


0.16 


BB1023 (3) 


0.01 


0.23 


0.31 


0.33 


BB1025 (4) 


0.03 


0.16 


0.20 


0.25 



The relative abundances of codons corresponding to building blocks which 
are part of thrombin-binding molecules in the pseudoround 4 output are 
significantly different than that of the library before pseudoround 1 (p = 0.028, 

1 5 student's t-test, 2-tailed, paired). This is in agreement with the expectation 
that pseudorounds will decrease the relative abundance of codons 
corresponding to building blocks which are not part of thrombin-binding small 
molecules and thereby increase the relative abundance of codons 
corresponding to building blocks which are part of thrombin-binding small 

20 molecules. 
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The output from pseudoround 5 was not significantly different that the output 
of pseudoround 4 (p = 0.19, student's t-test, 2-tailed, paired). 

This is not unexpected because after a number of pseudorounds the majority 
of the remaining library fraction will constitute display molecules with 
significant affinity for thrombin. As the abundance of a specific codon 
combination is positively correlated with the affinity of the displayed molecule, 
i.e., a high relative abundance of an identifier oligonucleotide indicates a 
significant affinity of its corresponding display molecule, it is of interest to 
identify the codon combinations with highest abundance. As the output 
furthermore contains a large number of binders the fraction of an individual 
binder will not change significantly once displayed molecules with low affinity 
for Thrombin have been removed by selection/ pseudorounding. Mismatch 
selection is a tool to circumvent this problem as it is useful for distinguishing 
between compounds represented with many templates (which will generally 
be binders of higher affinity) and compounds represented with few templates 
(which will generally be binders of relatively lower affinity). 

The mismatch selection strategy was applied to the PCR amplified output 
from pseudoround 4 using the following conditions: To 5 pi PCR product of 
pseudoround 4 was added 1 pi (10x) Surveyor reaction buffer 
(Transgenomic), 0.5 pi Enhancer (Transgenomic), 1 pi Surveyor Nuclease 
(Transgenomic), and 2.5 pi H 2 0. The reaction components were mixed and 
incubated for 2 hours at 42°C. Then the sample was heated to 95°C for 10 
minutes and identifier oligonucleotide duplexes were allowed to form by 
lowering the temperature. To 3 pi of the resulting sample was added 1 pi 5 x 
EXT buffer [100 mM HEPES pH 7.5, 750 mM NaCl, 40 mM MgCI 2 ], 4 pi H 2 0, 
and 1 pi E. coli Exonuclease I (10 U/pl) (USB). The reaction components 
were mixed and incubated at 37°C for 4 hours followed by enzyme 
inactivation by incubation at 95°C for 15 minutes. 
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PCR amplification of the output of mismatch selection on the output from 
pseudoround 4 was performed followed by cloning and sequencing as 
described in example 8. 

5 

The relative abundances of building blocks corresponding to display 
molecules with significant thrombin affinity in the output from mismatch 
selection performed on output from pseudoround 4 were significantly 
different that those of the output of pseudoround 4 (p = 0.035, student's t- 
10 test, 2-tailed, paired). 

Moreover, The relative abundances of building blocks corresponding to 
display molecules with significant thrombin affinity in the output from 
mismatch selection performed on output from pseudoround 4 were also 
1 5 significantly different that those of the output of pseudoround 5 (p = 0.0083, 
student's t-test, 2-tailed, paired). 

In conclusion, the relative abundance of codons corresponding to building 
blocks forming display molecules which bind thrombin were significantly 

20 increased by performing mismatch selection on the output from pseudoround 
4. In contrast the relative abundance of codons corresponding to building 
blocks which bind thrombin were not significantly increased by performing 
pseudoround 5 using the output from pseudoround 4. This demonstrates the 
advantage provided by the mismatch selection strategy in the attempt to 

25 identify molecules or molecular fragments which bind a given target, 

Example 10: Factors affecting MISE efficiency and use o f MISE to identify 
identifier oligonucleotides corresponding to disp lay molecules which bind a 
target from a library encoding 10 A 10 small mo lecule soecies. 

30 
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In this example a (hypothetical) library containing 10*10 small molecules is 
provided by the method outlined in example 8. 

With libraries of this size it is likely that a single step of contacting the library 
5 with a target is not sufficient to partition binders from non-binders to an 
extend that decoding identifier oligonucleotides corresponding to binding 
small molecules is practically feasible, e.g., it might be necessary to decode 
more than 1 .000.000 identifier oligonucleotides to observe that the relative 
abundance of a specific identifier oligonucleotide corresponding to a target- 
1 0 binding display molecule is significantly different from the average relative 
abundances of identifier oligonucleotides corresponding to display 
molecules with insignificant affinity for target. 

An attempt to increase the feasibility of decoding might include performing 
1 5 multiple rounds of contacting a target with the library followed by partitioning 
of the bound library fraction, i.e., to perform pseudorounds as described in 
example 9. But as demonstrated in example 9 it is probable that even after 
performing several pseudorounds (i.e. 4 pseudorounds in example 9) no 
identifier oligonucleotide is significantly more abundant than other identifier 
20 oligonucleotide when performing extensive decoding of identifier 
oligonucleotides by PCR, cloning and sequencing. 

In this case the mismatch selection strategy provides a method for 
enriching, i.e., increasing the relative abundance of identifier 
25 oligonucleotides corresponding to display molecules which bind a target to 
an extent that identification of these oligonucleotides becomes feasible 
• using PCR, cloning and sequencing, e.g., by decoding a total of 100, 1000, 
or 10.000 identifier oligonucleotides. 

30 The efficiency of mismatch selection is dependent on a number of probable 
factors. One such factor is the extend to which one or more rounds of 
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contacting the library with target followed by partitioning of the target-binding 
library fraction alters the relative abundances of the identifier 
oligonucleotides corresponding to display molecules that bind target. This is 
again dependent on the display molecules present in the library and the 
5 identity of the target used. Furthermore, a library may contain many binders 
to a specific target type but essentially no binders to other target types. The 
efficiency of MISE also depends on the efficiency of degrading or removing 
heteroduplexes while retaining homoduplexes during the MISE procedure 
as describe in example 1 calculated example C. 

10 

Although a single round of MISE can provide significant enrichment of a 
specific identifier oligonucleotide as shown in examples 4 and 8 it is 
possible and indeed very easy and affordable both in terms of time and 
monetary cost to perform multiple rounds of MISE as described in example 
15 5. 

It is likely that performing pseudorounds on a library containing 1 0 A 1 0 small 
molecules can result in a 1.000.000-fold enrichment, i.e., a 1.000.000-fold 
increase in the relative abundance, of a specific identifier oligonucleotide 
20 corresponding to a display molecule which binds target compared to 

identifier oligonucleotides which do not bind target significantly. Because a 
library initially containing 10 A 10 individual display molecule species is likely 
to contain a large number of display molecule species that bind a given 
target it is likely that further pseudorounding at some point will fail to enrich 
25 a specific identifier oligonucleotide. That is the relative abundances of 
identifier oligonucleotides corresponding to display molecules with 
significant target binding affinity will remain essentially unaltered in the 
library fraction recovered from pseudoround to pseudoround. Such a 
situation is described in example 9. However, it is likely that homo- and 
30 heteroduplexes can be recovered from such a library fraction during the 

MISE method with efficiencies of 50 % and 0.1 %, respectively. Importantly, 
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such recovery efficiencies can be easily manipulated and optimized/Using 
these values in a calculation as outlined in example 1 results in the 
enrichment of a specific target binding display molecule of 1 0.000 fold with 
six rounds of MISE. This results in a total enrichment of 10.000.000.000 fold 

5 by combining the enrichment fold achievable by pseudorounding with the 
enrichment fold achievable by MISE. As the majority or essentially all of the 
remaining identifier oligonucleotides obtained after 6 rounds of MISE will 
correspond to a display molecule which binds target it becomes easy to 
identify this identifier oligonucleotide by PCR, cloning and sequencing. It is 

1 0 possible to decode identifier oligonucleotides by PCR, cloning and 

sequencing after each MISE step and thereby follow . the MISE enrichment 
process and identify a diverse set of display molecules with affinity for 
target. 

15 importantly, the ability of MISE to ease or indeed make possible the 
identification of identifier oligonucleotides corresponding to display 
molecules that bind target is not dependent on initial use of several 
- pseudorounds. The only prerequisite for MISE to enable such identification 
is that the relative abundances of identifier oligonucleotides corresponding 

20 to target-binding display molecules is increased from that in the initial 

library, e.g., by contacting the library with the target and recovering a target- 
bound library fraction. 
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